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Increased Lysine Production by Gene Amplification 



Inventors: Hanke, Paul D. 

Li-D'Elia, Lhing-Yew 
Rayapati, John 

Cross-Reference To Related Applications 

The present application claims benefit to the filing dates of U.S. 
Provisional Application No. 60/184,130, filed February 22, 2000; and U.S. 
Provisional Application No. 60/173,707, filed December 30, 1999, each ofwhich 
is herein incorporated by reference. 

Background of the Invention 

Field of the Invention 

The invention relates to the areas of microbial genetics and recombinant 
DNA technology. The invention provides gene sequences, vectors, 
microorganisms, promoters and regulatory proteins useful for the production of 
L-lysine. The invention further provides a method to increase the production of 
L-lysine. 

Related AH 

L-lysine is an important economic product obtained principally by 
industrial-scale fermentation utilizing the Gram positive Corynebacterium 
glutamicwn, Brevibacterium flavum and Brevibacterium lactofermentum 
(Kleemann, A., et. al, Amino Acids, in Ullmann'S ENCYCLOPEDIA OF 
Industrial Chemistry, vol. A2, pp. 57-97, Weinham: VCH-Verlagsgesellschaft 
(1985)). 

The stereospecificity of the amino acids produced by fermentation makes 
the process advantageous compared with synthetic processes; generally L-form 



amino acids are produced by the microbial fermentation process. Theproduction 
of L-lysine and other amino acids through fermentation, utilizing cheap carbon 
sources such as molasses, glucose, acetic acid and ethanol, is a relatively 
inexpensive means of production. 

Microorganisms employed in microbial processes for amino acid 
production may be divided into 4 classes: wild-type strain, auxotrophic mutant, 
regulatory mutant and auxotrophic regulatory mutant (K. Nakayama et ah, in 

Nutritional Improvement of Food and Feed Proteins, M. Friedman, ed., 

(1978), pp. 649-661). 

Several fermentation processes utilizing various strains isolated for 
auxotrophic or resistance properties are known in the art for the production of 
L-lysine: U.S. Patent No. 2,979,439 discloses mutants requiring amino acid 
supplementation (homoserine, or L-methionine and L- threonine); U.S. Patent 
No. 3,700,557 discloses mutants having a nutritional requirement for L- 
threonine, L-methionine, L-arginine, L-histidine, L-leucine, L-isoleucine, L- 
phenylalanine, L-cystine, or L-cysteine; U.S. Patent No. 3,707,441 discloses a 
mutant having a resistance to an L-lysine analog; U.S. Patent No. 3,687,810 
discloses a mutant having both an ability to produce L-lysine and a resistance to 
bacitracin, penicillin G or polymyxin; U.S. Patent No. 3,708,395 discloses 
mutants having a nutritional requirement for homoserine, L-threonine, L- 
threonine and L-methionine, L-leucine, L-isoleucine or mixtures thereof and a 
resistance to L-lysine, L-threonine, L-isoleucine or analogs thereof; U.S. Patent 
No. 3,825,472 discloses a mutant having a resistance to an L-lysine analog; U.S. 
Patent No. 4,169,763 discloses mutant strains of Corynebacterium that produce 
L-lysine and are resistant to at least one of aspartic analogs and sulfa drugs; U.S. 
Patent No. 5,846,790 discloses a mutant strain able to produce L-glutamic acid 
and L-lysine in the absence of any biotin action-suppressing agent; and U.S. 
Patent No. 5,650,304 discloses a strain belonging to the genus Corynebacterium 
or Brevibacterium for the production of L-lysine that is resistant to 
4-N-(D-alanyl)-2,4-diamino-2,4-dideoxy-L-arabinose2,4-dideoxy-L-arabinose 
or a derivative thereof. 



A considerable amount is known regarding the biochemical pathway for 
L-lysine synthesis in Corynebacterium species (recently reviewedby Sahm etal, 
Ann.N. Y.Acad.Sci. 752:25-39(1996)). Entry into the L-lysine pathway begins 
with L-aspartate (see Figure 1), which itself is produced by transamination of 
oxaloacetate. A special feature of C. glutamicum is its ability to convert the L- 
lysine intermediate piperidine 2,6-dicarboxylate to diaminopimelate by two 
different routes, i.e. by reactions involving succinylated intermediates or by the 
single reaction of diaminopimelate dehydrogenase. Overall, carbon flux into the 
pathway is regulated attwo points: first, through feedback inhibition of aspartate 
kinase by the levels of both L-threonine and L-lysine; and second through the 
control of the level of dihydrodipicolinate synthase. Therefore, increased 
production of L-lysine may be obtained in Corynebacterium species by 
deregulating and increasing the activity of these two enzymes. 

More recent developments in the area of L-lysine fermentative production 
in Corynebacterium species involve the use of molecular biology techniques to 
augment L-lysine production. The following examples are provided as being 
exemplary of the art: U. S. Patent Nos. 4,560,654 and 5,236,831 disclose an 
L-lysine producing mutant strain obtained by transforming a host 
Corynebacterium or Brevibacterium species mi croorganism whi ch is s ensiti ve to 
S-(2-aminoethyl)-cysteine with a recombinant DNA molecule wherein a DNA 
fragment conferring both resistance to S-(2-aminoethyl)-cysteine and L-lysine 
producing ability is inserted into a vector DNA; U. S. Patent No. 5,766,925 
discloses a mutant strain produced by integrating a gene coding for 
aspartokinase, originating from coryneform bacteria, with desensitized feedback 
inhibition by L-lysine and L-threonine, into chromosomal DNA of a 
Corynebacterium species bacterium harboring leaky type homoserine 
dehydrogenase or a Corynebacterium species deficient in homoserine 
dehydrogenase gene; increased L-lysine production is obtained by gene 
amplification by way of a plasmid vector or utilizing a gene replacement 
strategy. European Patent Applications EP 0 81 1 682 A2 and EP 0 854 189 A2 
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both provide for increased production of L-lysine in Corynebacterium species by 
way of gene amplification based on plasmid copy number. 

Summary of the Invention 

It is an object of the invention to provide a method to increase the 
production of an amino acid in Corynebacterium species by amplifying, i.e., 
increasing, the number of a gene or genes of an amino acid biosynthetic pathway 
in a host cell. Particularly preferred Corynebacterium species include 
Corynebacterium glutamicum, Brevibacterium flavum, and Brevibacterium 
lactofermentum. 

It is an object of the invention to provide an isolated feed back resistant 
aspartokinase enzyme wherein the naturally occurring threonine amino acid 
residue 380 in the feedback sensitive form is changed to isoleucine in the ask 
gene of ATCC 21529. It is an object of the invention to provide an isolated ask 
polypeptide comprising the amino acid sequence of SEQ ID NO: 2. It is another 
object of the invention to provide an isolated polynucleotide molecule 
comprising anucleotide sequence encoding the polypeptide sequence of SEQ ID 
NO:2. It is another object of the invention to provide an isolated polynucleotide 
molecule comprising a nucleic acid having the sequence of SEQ ID NO: 1 . 

It is another object of the invention to provide a method comprising 
transforming a Corynebacterium species host cell with apolynucleotide molecule 
comprising anucleotide sequence encoding a polypeptide comprising amino acid 
SEQ ID NO:2, wherein said isolated polynucleotide molecule is integrated into 
said host cell's chromosome thereby increasing the total number of said amino 
acid biosynthetic pathway genes in said host cell chromosome, and selecting a 
transformed host cell. It is a further object of the invention to provide a method 
comprising screening for increased amino acid production. The method may 
further comprise growing said transformed host cell in a medium and purifying 
an amino acid produced by said transformed host cell. 
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In another embodiment, a method to increase the production of an amino 
acid is a method comprising transforming a Corynebacterium species host cell 
with an isolated nucleic acidmolecule encoding the amino acidsequence of SEQ 
ID NO:2, wherein said isolated nucleic acidmolecule is integrated into saidhost 
5 cell's chromosome thereby increasing the total number of said amino acid 

biosynthetic pathway genes in said host cell chromosome, and wherein said 
isolated nucleic acid molecule further comprises at least one of the following: a 
polynucleotide encoding a Corynebacterium species lysine pathway asd amino 
acid sequence; a polynucleotide encoding a Corynebacterium species lysine 

10 pathway dapA amino acid sequence; a polynucleotide encoding a 

Corynebacterium species lysine pathway dapB amino acid sequence; a 
polynucleotide encoding a Corynebacterium species lysine pathway ddh amino 
acid sequence; a polynucleotide encoding a Corynebacterium species lysine 
pathway 'lysA amino acid sequence; a polynucleotide encoding a 

15 Corynebacterium species lysine pathway lysA amino acid sequence; a 

polynucleotide encoding a Corynebacterium species lysine pathway ORF2 amino 
acid sequence, and selecting a transformed host cell. The method may further 
comprise growing said transformedhost cell in a medium and purifying an amino 
acid produced by said transformed host cell. 

20 The term " 'lysA " refers to a truncated lysA gene or amino acid sequence 

used by Applicants and described infra. The term "lysA " refers to the full length 
lysA gene or amino acid sequence used by Applicants and described infra. 

It is another object of the invention to provide an isolated polynucleotide 
molecule comprising a nucleic acid molecule encoding the Corynebacterium 

25 glutamicum lysine pathway ask amino acid sequence of SEQ ID NO: 2; and at 

least one additional Corynebacterium species lysine pathway gene selected from 
the group consisting of a nucleic acid molecule encoding the asd polypeptide, a 
nucleic acid molecule encoding the dapA polypeptide, a nucleic acid molecule 
encoding the dapB polypeptide, a nucleic acid molecule encoding the ddh 

30 polypeptide, a nucleic acid molecule encoding the 'lysA polypeptide, a nucleic 

acid molecule encoding the lysA polypeptide and a nucleic acid molecule 
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encoding the ORF2 polypeptide. In a preferred embodiment of the invention, the 
isolated polynucleotide molecule comprises pK184-KDABH'L. In another 
preferred embodiment of the invention, the isolated nucleic acid molecule 
comprises pKl 84-KDAB. In another preferred embodiment of the invention, the 
5 isolated nucleic acid molecule comprises pD2-KDABHL. In another preferred 

embodiment of the invention, the isolatednucleic acidmolecule comprises pD 11- 
KDABH'L. 

It is another object of the invention to provide a host cell transformed 
with an isolated polynucleotide molecule comprising a nucleotide sequence 

1 0 encoding an isolatedpolypeptide comprising the amino acid sequence of SEQ ID 

NO: 2, wherein the isolatednucleic acidmolecule is integrated into the host cell's 
chromosome thereby increasing the total number of amino acid biosynthetic 
pathway genes in the host cell chromosome. In one embodiment the 
polynucleotide further comprises at least one additional Corynebacterium species 

15 lysine pathway gene selected from the group consisting of: a nucleic acid 

molecule encoding an as d polypeptide; anucleic acidmolecule encoding a dapA 
polypeptide; anucleic acidmolecule encoding a dapB polypeptide; anucleic acid 
molecule encoding a ddh polypeptide; a nucleic acid molecule encoding a 'lysA 
polypeptide; a nucleic acid molecule encoding a lysA polypeptide; and a nucleic 

20 acid molecule encoding an ORF2 polypeptide. 

In another embodiment, the polynucleotide further comprises a nucleic 
acid molecule encoding a polypeptide wherein said asd polypeptide is SEQ ID 
NO:4; said dapA polypeptide is SEQ ID NO:6; said dapB polypeptide is SEQ ED 
NO: 8; said ddh polypeptide is SEQ ID NO: 10; said 'lysA polypeptide is SEQ ED 

25 NO: 2 1 ; said lysA polypeptide is SEQ ID NO: 1 4; and said ORF2 polypeptide is 

SEQ ID NO: 16. 

In another embodiment, the polynucleotide further comprises a nucleic 
acid molecule wherein said asd polypeptide is SEQ ED NO:4; said dapA 
polypeptide is SEQ ID NO:6; said dapB polypeptide is SEQ ED NO:8; said ddh 
30 polypeptide is SEQ ID NO: 10; said 'lysA polypeptide is SEQ ID NO:21; said 
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lysA polypeptide is SEQ ID NO: 14; and said ORF2 polypeptide is SEQ ID 
NO: 16. 

In another embodiment, the polynucleotide further comprises a nucleic 
acid molecule encoding the asd amino acid sequence of SEQ ID NO: 4; a nucleic 
5 acid molecule encoding the dapA amino acid sequence of SEQ ID NO:6; a 

nucleic acidmolecule encoding the dapB amino acidsequence of SEQ ID NO:8; 
and a nucleic acid molecule encoding the ORF2 amino acid sequence of SEQ ID 
NO: 16. 

In another embodiment, the polynucleotide further comprises a nucleic 

10 acid molecule encoding the asd amino acid sequence of SEQ ID NO:4; anucleic 

acid molecule encoding the dap A amino acid sequence of SEQ ID NO: 6; a 
nucleic acid molecule encoding the dapB amino acid sequence of SEQ ID NO: 8; 
a nucleic acid molecule encoding the ddh amino acid sequence of SEQ ID 
NO: 10; and a nucleic acidmolecule encoding the ORF2 amino acid sequence of 

15 SEQ ID NO: 16. 

In another embodiment, the polynucleotide further comprises a nucleic 
acid molecule encoding the asd amino acid sequence of SEQ ID NO:4; a nucleic 
acid molecule encoding the dapA amino acid sequence of SEQ ID NO:6; a 
nucleic acidmolecule encoding the dapB amino acid sequence of SEQ IDNO:8; 

20 a nucleic acid molecule encoding the ddh amino acid sequence of SEQ ID 

NO:10; anucleic acidmolecule encoding the 'lysA amino acid sequence of SEQ 
ID NO : 2 1 ; and a nucleic acid molecule encoding the ORF2 amino acid sequence 
of SEQ ID NO: 16. 

In another embodiment, the polynucleotide further comprises a nucleic 

25 acid molecule encoding the asd amino acid sequence of SEQ ID NO:4; a nucleic 

acid molecule encoding the dapA amino acid sequence of SEQ ID NO: 6; a 
nucleic acidmolecule encoding the dapB amino acidsequence of SEQ ID NO: 8; 
a nucleic acid molecule encoding the ddh amino acid sequence of SEQ ID 
NO: 1 0; a nucleic acid molecule encoding the lysA amino acid sequence of SEQ 

30 ID NO: 14; and anucleic acidmolecule encoding the ORF2 amino acid sequence 

of SEQ ID NO: 16. 



In one embodiment, the transformed host cell is a Brevibacterium selected 
from the group consisting of Brevibacterium flavum NRRL-B30218, 
Brevibacterium flavum NRRL-B30219, Brevibacterium lactofermentum 
~MRRL-B30220,Brevibacterium lactofermentum NRRL-B3022 1 , Brevibacterium 
lactofermentum NRRL-B30222, Brevibacterium flavum NRRL-30234 and 
Brevibacterium lactofermentum NRRL-30235. In another embodiment, the host 
cell is Escherichia coli DH5 a MCR NRRL-B30228. In another embodiment, 
the host cell is a C. glutamicum selected from the group consisting of C. 
glutamicum NRRL-B30236 and C. glutamicum NRRL-B30237. 

It is another object of the invention to provide a method of producing 
lysine comprising culturing the host cells comprising the amino acid sequence of 
SEQ ID NO: 2 wherein said host cells comprise one or more of (a) increased 
enzyme activity of one or more lysine biosynthetic pathway enzymes compared 
to the genetically unaltered nonhuman host cell; (b) one or more copies of each 
gene encoding a lysine biosynthetic pathway enzyme; and, (c) alteration of one 
or more transcription factors regulating transcription of one or more genes 
encoding a lysine biosynthetic pathway enzyme, wherein said host cell produces 
lysine in said culture medium. In one embodiment of the invention, the increased 
enzyme activity comprises overexpressing one or more genes encoding one or 
more lysine biosynthetic pathway enzymes. In another embodiment of the 
invention the increased enzyme activity results from the activity of one or more 
modified lysine biosynthetic pathway enzymes wherein said enzyme 
modification results in a change in kinetic parameters, allosteric regulation, or 
both, compared to the enzyme lacking the modification. In another embodiment 
of the invention, alteration of one or more transcription factors comprises one or 
more mutations in transcription inhibitor proteins, one or more mutations in 
transcription activator proteins, or both, wherein said one or more mutations 
increases transcription of the target nucleotide sequence compared to the 
transcription by said one or more transcription factors lacking said alteration(s) . 
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It is an object of the invention to provide an isolated polypeptide, wherein 
saidpolypeptide comprises an amino acidsequence having at least 95% sequence 
identity to the amino acid sequence of SEQ ID NO: 1 9. It is a further object of the 
invention to provide an isolated polypeptide comprising the amino acid sequence 
5 of SEQ ID NO: 19. It is a further object of the invention to provide an isolated 

polynucleotide comprising anucleic acid having the sequence of SEQ ID NO: 1 8 . 
It is another object of the invention to provide host cell NRRL B 3 03 60. 

It is an obj ect of the invention to provide an isolated polypeptide wherein 
said polypeptide comprises a polypeptide having at least 95% sequence identity 
10 to the amino acid sequence of SEQ ID NO:21. It is a further object of the 

invention to provide an isolated polypeptide comprising the amino acid sequence 
of SEQ ID NO:21. It is a further object of the invention to provide a 
polynucleotide molecule comprising anucleic acid having the sequence of SEQ 
ID NO: 20. 

15 It is an object of the invention to provide an isolated polynucleotide 

molecule comprising anucleotide sequence encoding the polypeptide comprising 
the amino acid sequence of SEQ ID NO:2, further comprising a promoter 
sequence where said promoter sequence has at least 95% sequence identity to 
SEQ ID NO: 17. It is a further object of the invention to provide an isolated 

20 polynucleotide molecule comprising a nucleotide sequence encoding the 

polypeptide comprising the amino acid sequence of SEQ ID NO:2, wherein the 
polynucleotide molecule further comprises the sequence of SEQ ID NO: 17. It 
is a further object of the invention to provide a host cell NRRL B30359. 

Further obj ects and advantages of the present invention will be clear from 

25 the description that follows. 
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Brief Description of the Figures 

Figure 1. A schematic of the L-lysine biosynthetic pathway in 
Corynebacterium glutamicum (Sahm et al.). 

Figure 2. The nucleotide sequence of as k(ATCC 2 1529 sequence) (SEQ 
5 IDNO:l). 

Figure 3 A, B. The amino acid sequence of ask (ATCC 2 1 529 sequence) 
(SEQIDNO:2). 

Figure 4. The nucleotide sequence of asd (ATCC 2 1529 sequence) (SEQ 
ID NO:3). 

1 0 Figure 5 A, B. The amino acid sequence of asd (ATCC 2 1 529 sequence) 

(SEQIDNO:4). 

Figure 6. The nucleotide sequence oidapA (NRRL-B1 1474) (SEQ ID 

NO:5 ). 

Figure 7. The amino acid sequence of dap A (NRRL-B1 1474) (SEQ ID 

15 NO:6). 

Figure 8. The nucleotide sequence of dapB (NRRL-B 1 1474) (SEQ ID 

NO:7). 

Figure 9 The amino acid sequence of dapB (NRRL-B 1 1474) (SEQ ID 

NO:8). 

20 Figure 10. The nucleotide sequence of ddh (NRRL-B 1 1474) (SEQ ID 

NO: 9). 

Figure 11 A, B. The amino acid sequence of ddh (NRRL-B 1 1474) (SEQ 
ID NO: 10). 

Figure 12. The nucleotide sequence of full length lysA (NRRL-B 1 1474) 
25 (SEQ ID NO: 1 1) used to obtain the truncated lysA {'lysA) nucleotide sequence. 

Underlined region annealed with lysA primer. 

Figure 13. The amino acid sequence of full length lysA (NRRL- 
B 11474) (SEQ ID NO: 12) comprising the truncated lysA ('lysA) amino acid 
sequence (SEQ ID NO: 21). Underlined L: the last amino acid residue of lysA 
30 encoded in the truncated PCR product. 
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Figure 14. The nucleotide sequence of full length lysA (pRS6) (SEQ ID 
NO: 13). 

Figure ISA, B, C. The amino acid sequence of full length lysA (pRS6) 
(SEQ ID NO: 14). 

5 Figure 16. The nucleotide sequence of ORF2 (NRRL-B 1 1 474) (SEQ 

ID NO: 15). 

Figure 1 7. The amino acid sequence of ORF2 (NRRL-B 1 1474) (SEQ 
ID NO: 16). 

Figure 18. A schematic depiction of the construction of the 5 and 6 
10 lysine pathway gene constructs of the invention. 

Figure 19. Comparison of the aspartokinase (ask) amino acid sequence 
fromATCC13032,N13 and ATCC21529. 

Figure 20. The nucleotide sequence of the Hpal-PvuII fragment from 
pRS6 (SEQ ID NO: 17) comprising the PI promoter. 
15 Figure 21 A, B. A schematic depiction of the construction of the 

pDElia2-KDABHPlL construct. 

Figure 22. A schematic depiction of the construction of the pDElia2 FC5 - 
KDBHL construct. 

Figure 23. The nucleotide sequence of truncated ORF2 (SEQ ID NO : 1 8) . 
20 Figure 24. The amino acid sequence of truncated ORF2 (SEQ ID 

NO: 19). 

Figure 25. The nucleotide sequence of truncated LysA ('lysA)(NRRL- 
B 11474) (SEQ ID NO:20). 

Figure 26. The amino acid sequence of truncated LysA ('LysA)(NRRL- 
25 B11474)(SEQIDNO:21). 
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Detailed Description of the Preferred Embodiments 
A. Definitions 

In order to provide a clear and consistent understanding of the 
specification and claims, including the scope to be given such terms, the 
5 following definitions are provided. It is also to be noted that the term "a" or "an" 

entity, refers to one or more of that entity; for example, "a polynucleotide," is 
understood to represent one or more polynucleotides. 

Allosteric Regulation. As used herein, the term refers to regulation of 
enzyme activity through the binding of one or more ligands (allosteric effectors) 

10 to one or more binding sites . The ligands may be the s ame molecule or different 

molecules. The molecules bind to sites on the enzyme other than the enzyme 
active site. As a result of the binding, a conformational change is induced in the 
enzyme which regulates affinity of the active site for its substrate or other 
ligands. Allosteric effectors may serve to enhance catalytic site substrate affinity 

15 (allosteric activators) or to reduce affinity (allosteric repressors). Allosteric 

effectors form the basis of metabolic control mechanisms such as feedback loops , 
for example (See, Copeland, Robert A., in Enzymes. A Practical Introduction to 
Structure, Mechanism, and Data Analysis, pages 279-296, Wiley-VCH, New 
York (1996)). 

20 Amino Acid Biosynthetic Pathway Genes. As used herein, the term 

"amino acid biosynthetic pathway gene(s)" is meant to include those genes and 
genes fragments encoding peptides, polypeptides, proteins, and enzymes, which 
are directly involved in the synthesis of amino acids. These genes may be 
identical to those which naturally occur within a host cell and are involved in the 

25 synthesis of any amino acid, and particularly lysine, within that host cell. 

Alternatively, there may be modifications or mutations of such genes, for 
example, the genes may contain modifications or mutations which do not 
significantly affect the biological activity of the encoded protein. For example, 
the natural gene may be modified by mutagenesis or by introducing or 
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substituting one or more nucleotides or by removing nonessential regions of the 
gene. Such modifications are readily performed by standard techniques. 

Auxotroph. As used herein, the term refers to a strain of microorganism 
requiring for growth an external source of a specific metabolite that cannot be 
synthesized because of an acquired genetic defect. 

Amino Acid Supplement. As used herein, the term refers to an amino 
acid required for growth and added to minimal media to support auxotroph 
growth. 

Chromosomal Integration. As used herein, the term refers to the 
insertion of an exogenous DNA fragment into the chromosome of a host 
organism; more particularly, the term is used to refer to homologous 
recombination between an exogenous DNA fragment and the appropriate region 
of the host cell chromosome. 

Enhancers. As used herein, the term refers to a DNA sequence which can 
stimulate promoter activity and may be an endogenous element or aheterologous 
element inserted to enhance the level, i.e., strength of a promoter. 

High Yield Derivative. As used herein, the term refers to strain of 
microorganism that produces a higher yield from dextrose of a specific amino 
acid when compared with the parental strain from which it is derived. 

Host Cell. As used herein, the term "host cell" is intended to be 
interchangeable with the term "microorganism." Where a difference is intended, 
the difference will be made clear. 

Isolated Nucleic Acid Molecule. As used herein, the term is intended to 
mean a nucleic acid molecule, DNA or RNA, which has been removed from its 
native environment. For example, recombinant DNA molecules contained in a 
vector are considered isolated for the purposes of the present invention. Further 
examples of isolated DNA molecules include recombinant DNA molecules 
maintained in heterologous host cells or purified (partially or substantially) DNA 
molecules in solution. Isolated RNA molecules include in vivo or in vitro RNA 
transcripts of the DNA molecules of the present invention. Isolated nucleic acid 
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molecules according to the present invention further include such molecules 
produced synthetically. 

Lysine Biosynthetic Pathway Protein. As used herein, the term "lysine 
biosynthetic pathway protein" is meant to include those peptides, polypeptides, 
5 proteins, and enzymes, which are directly involved in the synthesis of lysine from 

aspartate. Also included are amino acid sequences as encoded by open reading 
frames (ORF), where the ORF is associated with a lysine biosynthetic pathway 
operon. These proteins may be identical to those which naturally occur within 
a host cell and are involved in the synthesis of lysine within that host cell. 

10 Alternatively, there may be modifications or mutations of such proteins, for 

example, the proteins may contain modifications or mutations which do not 
significantly affect the biological activity of the protein. For example, the natural 
protein may be modified by mutagenesis or by introducing or substituting one or 
more amino acids, preferably by conservative amino acid substitution, or by 

15 removing nonessential regions of the protein. Such modifications are readily 

performed by standard techniques. Alternatively, lysine biosynthetic proteins 
may be heterologous to the particular host cell. Such proteins may be from any 
organism having genes encoding proteins having the same, or similar, 
biosynthetic roles. 

20 Mutagenesis. As used herein, the term refers to a process whereby a 

mutation is generated in DNA. With "random" mutagenesis, the exact site of 
mutation is not predictable, occurring anywhere in the genome of the 
microorganism, and the mutation is brought about as a result of physical damage 
caused by agents such as radiation or chemical treatment. rDNA mutagenesis is 

25 directed to a cloned DNA of interest, and it may be random or site-directed. 

Mutation. As used herein, the term refers to a one or more base pair 
change, insertion or deletion, or a combination thereof, in the nucleotide 
sequence of interest. 

Operably Linked. As used herein, the term "operably linked" refers to 

30 a linkage of polynucleotide elements in a functional relationship. A nucleic acid 

is "operably linked" when it is placed into a functional relationship with another 
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nucleic acid sequence. For instance, a promoter or enhancer is operably linked 
to a coding sequence if it affects the transcription of the coding sequence. 
Operably linked means that the DNA sequences being linked are typically 
contiguous and, where necessary, join two protein coding regions, contiguous 
5 and in reading frame. However, since enhancers generally function when 

separated from the promoter by several kilobases and intronic sequences may be 
of variable lengths, some polynucleotide elements may be operably linked but 
not contiguous. 

Operon. As used herein, the term refers to a contiguous portion of a 

10 transcriptional complex in which two or more open reading frames encoding 

polypeptides are transcribed as amulti-cistronic messenger RNA, controlled by 
a cis-acting promoter and other cis-acting sequences necessary for efficient 
transcription, as well as additional cis acting sequences important for efficient 
transcription and translation (e.g., mRNA stability controlling regions and 

15 transcription termination regions). The term generally also refers to a unit of 

gene expression and regulation, including the structural genes and regulatory 
elements in DNA. 

Parental Strain. As used herein, the term refers to a strain of host cell 
subjected to some form of treatment to yield the host cell of the invention. 

20 Percent Yield From Dextrose. As used herein, the term refers to the 

yield of amino acid from dextrose defined by the formula [(g amino acid 
produced/ g dextrose consumed)* 100] = % Yield. 

Phenotype. As used herein, the term refers to observable physical 
characteristics dependent upon the genetic constitution of a host cell. 

25 Promoter. As used herein, the term "promoter" has its art-recognized 

meaning, denoting a portion of a gene containing DNA sequences that provide 
for the binding of RNA polymerase and initiation of transcription and thus refers 
to a DNA sequence capable of controlling the expression of a coding sequence 
or functional RNA. Promoter sequences are commonly, but not always, found in 

30 the 5' non-coding regions of genes. In general, a coding sequence is located 3' to 

a promoter sequence. Sequence elements within promoters that function in the 
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initiation of transcription are often characterized by consensus nucleotide 
sequences. The promoter sequence consists of proximal and more distal 
upstream elements (enhancers). As usedherein, the term "endogenous promoter" 
refers to a promoter sequence which is a naturally occurring promoter sequence 
5 in that host microorganism. The term "heterologous promoter" refers to a 

promoter sequence which is a non-naturally occurring promoter sequence in that 
host microorganism. The heterologous occurring promoter s equence may be from 
any prokaryotic or eukaryotic organism. A synthetic promoter is a nucleotide 
sequence, having promoter activity, and not found naturally occurring in nature. 
10 Promoters may be derived in their entirety from a native gene, or be 

hybrid promoters. Hybrid promoters are composed of different elements derived 
from different promoters found in nature, or even comprise synthetic DNA 
segments. Hybrid promoters may be constitutive, inducible or environmentally 
responsive. 

15 Useful promoters include constitutive and inducible promoters. Many 

such promoter sequences are known in the art. See, for example, U.S. Pat. Nos. 
4,980,285; 5,631,150; 5,707,828; 5,759,828; 5,888,783; 5,919,670, and, 
Sambrook, et al., Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold 
Spring Harbor Press (1989). Other useful promoters include promoters which are 

20 neither constitutive nor responsive to a specific (or known) inducer molecule. 

Such promoters may include those that respond to developmental cues (such as 
growth phase of the culture), or environmental cues (such as pH, osmoticum, 
heat, or cell density, for example). 

Examples of environmental conditions that may effect transcription by 

25 inducible promoters include anaerobic conditions, elevated temperature, or the 

presence of light. It is understood by those skilled in the art that different 
promoters may direct the expression of a gene in different cell types, or in 
response to different environmental conditions. Promoters which cause a gene to 
be expressed in most cell types at most times are commonly referred to as 

30 "constitutive promoters." It is further recognized that since in most cases the 

exact boundaries of regulatory sequences have not been completely defined, 
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DNA fragments of different lengths may have identical or similar promoter 
activity. 

Relative Growth. As used herein, the term refers to a measurement 
providing an assessment of growth by directly comparing growth of a parental 
strain with that of a progeny strain over a defined time period and with a defined 
medium. 

Transcription factor. As used herein, the term "transcription factor" 
refers to RNA polymerases, and other proteins that interact with DNA in a 
sequence-specific manner and exert transcriptional regulatory effects. 
Transcriptional factors may be transcription inhibitory proteins or transcription 
activator proteins. In the context of the present invention, binding sites for 
transcription factors (or transcription complexes) are often included in the 
transcriptional regulatory element(s). 

Transcription factor recognition site. As used herein, a "transcription 
factor recognition site" and a "transcription factor binding site" refer to a 
polynucleotide sequence(s) or sequence motif(s) which are identified as being 
sites for the sequence-specific interaction of one or more transcription factors, 
frequently taking the form of direct protein-DNA binding. Typically, 
transcription factor binding sites can be identified by DNA footprinting, gel 
mobility shift assays, and the like, and/or can be predicted on the basis of known 
consensus sequence motifs, or by other methods known to those of skill in the 
art. 

Transcriptional Complex. As used herein, the term "transcriptional 
unit" or "transcriptional complex" refers to a polynucleotide sequence that 
comprises a structural gene (one or more exons), a cis-acting linked promoter and 
one or more other cis-acting sequences necessary for efficient transcription of the 
structural sequences, distal regulatory elements necessary for appropriate 
transcription of the structural sequences, and additional cis sequences important 
for efficient transcription and translation (e.g., polyadenylation site, mRNA 
stability controlling sequences). See, for example U.S. Patent No. 6,057,299. 
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Transcriptional Regulatory Element. As used herein, the term 
"transcriptional regulatory element" refers to a DNA sequence which activates 
transcription alone or in combination with one or more other DNA sequences. A 
transcriptional regulatory element can, for example, comprise a promoter, 
5 response element, negative regulatory element, silencer element, gene 

suppressor, and/or enhancer. See, for example, U.S. Patent No. 6,057,299. 

B. Microbiological and Recombinant DNA Methodologies 

The invention as provided herein utilizes some methods and techniques 
that are known to those skilled in the arts of microbiology and recombinant DNA 

10 technologies. Methods and techniques for the growth of bacterial cells, the 

introduction of isolated DNA molecules into host cells, and the isolation, cloning 
and sequencing of isolated nucleic acid molecules, etc., are a few examples of 
such methods and techniques. These methods and techniques are described in 
many standard laboratory manuals, such as Davis et al, Basic Methods In 

1 5 Molecular Biology (1 986), J.H. Miller, Experiments in Molecular Genetics, Cold 

Spring Harbor Laboratory Press, Cold Spring Harbor, New York (1972); J.H. 
Miller, A Short Course in Bacterial Genetics, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, New York (1992); M. Singer and P. Berg, Genes & 
Genomes, University Science Books, Mill Valley, California (1991); J. 

20 Sambrook, E.F. Fritsch and T. Maniatis, Molecular Cloning: A Laboratory 

Manual, 2d ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York ( 1 989); P.B . Kaufman et al. , Handbook of Molecular and Cellular Methods 
in Biology and Medicine, CRC Press, Boca Raton, Florida (1995); Methods in 
Plant Molecular Biology and Biotechnology, B.R. Glick and J.E. Thompson, 

25 eds., CRC Press, Boca Raton, Florida (1993); and P.F. Smith-Keary, Molecular 

Genetics of Escherichia coli, The Guilford Press, New York, NY (1989), all of 
which are incorporated herein by reference in their entireties. 

Unless otherwise indicated, all nucleotide sequences newly described 
herein were determined using an automated DNA sequencer (such as the Model 



-19- 



373 from Applied Biosysteras, Inc.)- Therefore, as is known in the art, for any 
DNA sequence determined by this automated approach, any nucleotide sequence 
determined herein may contain some errors. Nucleotide sequences determined 
by automation are typically at least about 90% identical, more typically at least 
about 95% to at least about 99.9% identical to the actual nucleotide sequence of 
the sequenced DNA molecule. The actual sequence can be more precisely 
determined by other approaches including manual DNA sequencing methods 
well known in the art. 

In certain embodiments, polynucleotides of the invention comprise a 
nucleic acid, the sequence of which is at least 90%, 9 1 %, 92%, 93%, 94%, 95%, 
96%, 97%, 98% or 99% identical to a sequence selected from the group 
consisting of SEQ ID NO: 17, SEQ ID NO: 18; and SEQ ID NO:20, or a 
complementary sequence thereof. 

By a polynucleotide comprising a nucleic acid, the sequence of which is 
at least, for example, 95% "identical" to a reference nucleotide sequence is 
intended that the nucleic acid sequence is identical to the reference sequence 
except that the nucleic acid sequence may include up to five mismatches per 
each 100 nucleotides of the reference nucleic acid sequence. In other words, to 
obtain a nucleic acid, the sequence of which is at least 95% identical to a 
reference nucleic acid sequence, up to 5% of the nucleotides in the reference 
sequence may be deleted or substituted with another nucleotide, or a number of 
nucleotides up to 5% of the total nucleotides in the reference sequence may be 
inserted into the reference sequence. The reference (query) sequence may be any 
one of the entire nucleotide sequences shown in SEQ ID NO: 17, SEQ ID NO: 1 8, 
or SEQ ID NO:20, or any fragment of any of these sequences, as described infra. 

As a practical matter, whether any particular nucleic acid sequence is at 
least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identical to, for 
instance, a nucleotide sequence consisting of SEQ ID NO: 17; SEQ ID NO.: 1 8, 
or SEQ ID NO:20, or a complementary sequence thereof, can be determined 
conventionally using sequence analysis computer programs such as a OMIGA® 
Version 2.0 for Windows, availablefrom Oxford Molecular, Ltd. (Oxford, U.K.). 
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OMIGA uses the CLUSTAL W alignment algorithm using the slow full dynamic 
programming alignment method with default parameters of an open gap penalty 
of 10 and an extend gap penalty of 5.0, to find the best alignment between two 
nucleotide sequences. When using CLUSTAL W or any other sequence 
5 alignment program to determine whether a particular sequence is, for instance, 

95% identical to a reference sequence according to the present invention, the 
parameters are set, of course, such that the percentage of identity is calculated 
over the full length of the reference nucleotide sequence such that gaps, 
mismatches, or insertions of up to 5% of the total number of nucleotides in the 
1 0 reference sequence are allowed. Other sequence analysis programs , known in the 

art, can be used in the practice of the invention. 

This embodiment of the present invention is directed to polynucleotides 
comprising a nucleic acid, the sequence of which is at least 90%, 91%, 92%, 
93%, 94%, 95%, 96%, 97%, 98% or 99% identical to anucleic acid sequence of 
15 SEQ ID NO: 17, SEQ ID NO: 18, and SEQ ID NO:20, or a complementary 

sequence thereof, irrespective of whether they have functional activity. This is 
because even where aparticularpolynucleotide does not have functional activity, 
one of skill in the art would still know how to use the nucleic acid molecule, for 
instance, as a hybridization probe, an SI nuclease mapping probe, or a 
20 polymerase chain reaction (PCR) primer. 

Preferred, however, are polynucleotides comprising a nucleic acid, the 
sequence of which is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% 
or 99% identical to a nucleic acid sequence of SEQ ID NO: 17, SEQ ID NO: 18 
or SEQ ID NO:20, or a complementary sequence thereof, which do, in fact, have 
25 functional activity in Corynebacterium species. 

By a polypeptide having an amino acid sequence at least, for example, 
95% "identical" to a reference amino acid sequence of a polypeptide is intended 
that the amino acid sequence of the claimed polypeptide is identical to the 
reference sequence except that the claimed polypeptide sequence may include up 
30 to five amino acid alterations per each 100 amino acids of the reference amino 

acid of the polypeptide. In other words, to obtain a polypeptide having an amino 
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acid sequence at least 95% identical to a reference amino acid sequence, up to 
5% of the amino acid residues in the reference sequence may be deleted or 
substituted with another amino acid, or a number of amino acids up to 5% of the 
total amino acid residues in the reference sequence may be inserted into the 
reference sequence. These alterations of the reference sequence may occur at the 
amino or carboxy terminal positions of the reference amino acid sequence or 
anywhere between those terminal positions, interspersed either individually 
among residues in the reference sequence or in one or more contiguous groups 
within the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 80%, 
85%, 90%o, 92%, 95%, 96%, 97%, 98% or 99% identical to, for instance, the 
amino acid sequence shown in SEQ ID NO:2 or to the amino acid sequence 
encoded by a nucleic acid sequence can be determined conventionally using 
known computer programs such the Bestfit program (Wisconsin Sequence 
Analysis Package, Version 8 for Unix, Genetics Computer Group, University 
Research Park, 575 Science Drive, Madison, WI 5371 1). When using Bestfit or 
any other sequence alignment program to determine whether a particular 
sequence is, for instance, 95% identical to a reference sequence according to the 
present invention, the parameters are set, of course, such that the percentage of 
identity is calculated over the full length of the reference amino acid sequence 
and that gaps in homology of up to 5% of the total number of amino acid 
residues in the reference sequence are allowed. 

In a specific embodiment, the identity between a reference sequence 
(query sequence, a sequence of the present invention) and a subject sequence, 
also referred to as a global sequence alignment, is determined using the FASTDB 
computer program based on the algorithm of Bmtlag et al. (Comp. App. Biosci. 
6:237-245 (1990)). Preferred parameters used in a FASTDB amino acid 
alignment are: Matrix=PAM 0, k-tuple=2, Mismatch Penalty=l, Joining 
Penalty=20, Randomization Group Length=0, Cutoff Score=l, Window 
Size=sequence length, Gap Penalty=5, Gap Size Penalty=0.05, Window 
Size=500 or the length of the subject amino acid sequence, whichever is shorter. 
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According to this embodiment, if the subject sequence is shorter than the query 
sequence due to N- or C-terminal deletions, not because of internal deletions, a 
manual correction is made to the results to take into consideration the fact that 
the FASTDB program does not account for N- and C-terminal truncations of the 
5 subj ect sequence when calculating global percent identity. For subj ect sequences 

truncated at the N- and C-termini, relative to the query sequence, the percent 
identity is corrected by calculating the number of residues of the query sequence 
that are N- and C-terminal of the subject sequence, which are not 
matched/aligned with a corresponding subject residue, as a percent of the total 

10 bases of the query sequence. A determination of whether a residue is 

matched/aligned is determined by results of the FASTDB sequence alignment. 
This percentage is then subtracted from the percent identity, calculated by the 
above FASTDB program using the specified parameters, to arrive at a final 
percent identity score. This final percent identity score is what is used for the 

15 purposes of this embodiment. Only residues to the N- and C-termini of the 

subject sequence, which are not matched/aligned with the query sequence, are 
considered for the purposes of manually adjusting the percent identity score. 
That is, only query residue positions outside the farthest N- and C-terminal 
residues of the subject sequence. For example, a 90 amino acid residue subject 

20 sequence is aligned with a 100 residue query sequence to determine percent 

identity. The deletion occurs at the N-terminus of the subject sequence and 
therefore, the FASTDB alignment does not show a matching/alignment of the 
first 10 residues at the N-terminus. The 10 unpaired residues represent 10% of 
the sequence (number of residues at the N- and C-termini not matched/total 

25 number of residues in the query sequence) so 1 0% is subtracted from the percent 

identity score calculated by the FASTDB program. If the remaining 90 residues 
were perfectly matched the final percent identity would be 90%. In another 
example, a 90 residue subject sequence is compared with a 100 residue query 
sequence. This time the deletions are internal deletions so there are no residues 

30 at the N- or C-termini of the subject sequence which are not match ed/aligned 

with the query In this case the percent identity calculated by FASTDB is not 
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manually corrected. Once again, only residue positions outside the N- and C- 
terminal ends of the subject sequence, as displayed in the FASTDB alignment, 
which are not matched/aligned with the query sequence are manually corrected 
for. No other manual corrections are made for the purposes of this embodiment. 

C. Methods and Processes of the Invention 

Various embodiments of the invention provide methods to increase the 
production of an amino acid and processes for the production of an amino acid 
from a Corynebacterium species host cell. Particularly preferred 
Corynebacterium species of the methods and processes of the invention include: 
Corynebacterium glutamicum, Brevibacterium flavum, Brevibacterium 
lactofermentum and other Cornynebacteria and Brevibacteria species known in 
the art. 

As will be understood by those skilled in the art, the term 
"Corynebacterium species" includes those organisms previously identified in the 
literature as "Brevibacterium species," for example Brevibacterium flavum and 
Brevibacterium lactofermentum which have now been reclassified into the genus 
Corynebacterium {Int. J. Syst. Bacteriol. 41: 255 (1981)). 

Amino acid biosynthetic pathway genes embodied by the methods and 
processes describedherein include those for L-glycine, L-alanine, L-methionine, 
L-phenylalanine, L-tryptophan, L-proline, L-serine, L-threonine, L-cysteine, 
L-tyrosine, L-asparagine, L-glutamine, L-aspartic acid, L-glutamic acid, L- 
lysine, L-arginine, L-histidine, L-isoleucine, L-leucine, and L-valine 
biosynthesis. Particularly preferred embodiments are drawn to biosynthetic 
pathway genes for L-lysine (Sahm et al, Ann. N. Y. Acad. Sci. 782: 25-39 
(1996)), L-threonine, L-isoleucine, L-tryptophan, and L-valine. 

By way of example, the amino acid pathway for L-lysine biosynthesis 
is well known to skilled artisans of amino acid production in Corynebacterium 
species. Genes encoding the enzymes important for the conversion of L- 
aspartate to L-lysine include the ask, asd, dap A, dapB, ddh and lysA genes 
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(Figure 1). Thus, the invention provides herein for exemplary purposes only, 
specific embodiments utilizing L-lysine biosynthetic pathway genes. Other 
embodiments drawn to the use of biosynthetic pathway genes for the synthesis 
of other amino acids are also encompassed by the invention described herein. 

The methods to increase the production of an amino acid and the 
processes for the production of an amino acid of the invention both utilize a step 
requiring the transformation of an isolated nucleic acid molecule into a 
Corynebacterium species host cell. As known to one skilled in the art, 
transformation of an isolated nucleic acid molecule into a host cell may be 
effected by electroporation, transduction or other methods. These methods are 
described in the many standard laboratory manuals referenced and incorporated 
herein. 

The methods to increase the production of an amino acid and the 
processes for the production of an amino acid of the invention both utilize a step 
requiring amplification of at least one amino acid biosynthesis pathway gene. As 
known to one skilled in the art, the term amplification means increasing the 
number of a gene or genes of an amino acid biosynthetic pathway by any means 
known in the art. Particularly preferred means of amplification include: (1) the 
addition an isolated nucleic acid molecule comprising copies of a gene or genes 
of a biosynthetic pathway by insertion into the chromosome of a host cell, for 
example by homologous recombination, and (2) the addition an isolated nucleic 
acid molecule comprising copies of a gene or genes of a biosynthetic pathway 
into a host cell by way of a self-replicating, extra-chromosomal vector, for 
example, a plasmid. 

Another method of the invention to increase the production of an amino 
acid comprises increasing the expression of at least one amino acid biosynthetic 
pathway gene. Preferred methods of increasing expression comprise using 
heterologous promoters, regulated promoters, unregulated promoters and 
combinations thereof. 

Methods of inserting an isolated nucleic acid molecule into the 
chromosome of a host cell are known to those skilled in the art. For example, 
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insertion of isolated nucleic acid molecules into the chromosome of 
Corynebacterium species may be done utilizing the pKl 84 plasmid described by 
Jobling, M. et ah, Nucleic Acids Research 18(17): 5315-5316 (submitted 1990). 
Because these vectors lack a Corynebacterium species origin of replication and 
5 contain a selectable marker such as kanamycin (fain), cells will only be capable 

of growing under selection if the vector has been inserted into the host cell 
chromosome by homologous recombination. 

In alternative embodiments, the invention also provides methods for 
increasing amino acid production and processes for the production of an amino 

10 acid wherein biosynthetic pathway gene amplification is accomplished through 

the introduction into a host cell of a self-replicating, extra-chromosomal vector, 
e.g. , a plasmid, comprising an isolated nucleic acid molecule encoding an amino 
acid biosynthetic pathway gene or genes. Suitable plasmids for these 
embodiments include pSRl and other derivatives of pSRl (Archer, J. et al, J. 

15 Gen. Microbiol. 139: 1753-1759 (1993)). 

For various embodiments of the invention drawn to a method to increase 
production of an amino acid, screening for increased production of an amino 
acid, for example L-ly sine, may be determined by directly comparing the amount 
of L-lysine produced in culture by a Corynebacterium species host strain to that 

20 of a Corynebacterium species transformed host strain in which an amino acid 

biosynthesis gene or genes are amplified. The level of production of the amino 
acid of choice may conveniently be determined by the following formula to 
calculate the percent yield from dextrose: [(g amino acid/L / (g dextrose 
consumed/L) ] *100. 

25 In one embodiment, the invention provides a method to increase the 

production of an amino acid comprising: (a) transforming a Corynebacterium 
species host cell with an isolated polynucleotide molecule comprising a 
nucleotide sequence encoding a polypeptide comprising the amino acid sequence 
of SEQ ID NO:2; (b) amplifying the number of at least one of the biosynthetic 

30 pathway genes for said amino acid in the chromosome of said host cell; 
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(c) selecting a transformed host cell; and (d) screening for increased production 
of said amino acid from said transformed host cell relative to said host cell. 

In a particularly preferred embodiment, the invention provides a method 
to increase the production of an amino acid comprising transforming a 
5 Corynebacterium species host cell with an isolated polynucleotide molecule 

comprising a nucleotide sequence encoding a polypeptide comprising the amino 
acid sequence of SEQ ID NO:2; and further comprising at least one of the 
following: a nucleic acid molecule encoding a Corynebacterium species lysine 
pathway asd amino acid sequence; a nucleic acid molecule encoding a 

10 Corynebacterium species lysine pathway dap A amino acid sequence; a nucleic 

acid molecule encoding a Corynebacterium species lysine pathway dapB amino 
acid sequence; a nucleic acid molecule encoding a Corynebacterium species 
lysine pathway ddh amino acid sequence; a nucleic acid molecule encoding a 
Corynebacterium species lysine pathway 'lysA amino acid sequence; a nucleic 

15 acid molecule encoding a Corynebacterium species lysine pathway lysA amino 

acid sequence; and a nucleic acid molecule encoding a Corynebacterium species 
lysine pathway ORF2 amino acid sequence. 

In another particular embodiment of the method, the isolated 
polynucleotide molecule further comprises at least one of the following: a 

20 nucleic acid molecule encoding the asd amino acid sequence of SEQ ID NO:4; 

a nucleic acid molecule encoding the dapA amino acid sequence of SEQ ID 
NO: 6; a nucleic acid molecule encoding the dapB amino acid sequence of SEQ 
IDNO:8; anucleic acid molecule encoding the ddh amino acid sequence of SEQ 
ID NO: 10, a nucleic acid molecule encoding the 'lysA amino acid sequence of 

25 SEQ ID NO:2 1 ; a nucleic acid molecule encoding the lysA amino acid sequence 

of SEQ ID NO: 14; and a nucleic acid molecule encoding the ORF2 amino acid 
sequence of SEQ ID NO: 16. 

In another particular embodiment of the method, the isolated 
polynucleotide molecule further comprises the following: anucleic acidmolecule 

30 encoding the as d amino acid sequence of SEQ ID NO: 4; anucleic acidmolecule 

encoding the dapA amino acid sequence of SEQ ID NO:6; a nucleic acid 
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molecule encoding the dapB amino acid sequence of SEQ ID NO: 8; and a 
nucleic acid molecule encoding the ORF2 amino acid sequence of SEQ ID 
NO: 16. 

In another particular embodiment of the method, the isolated 
5 polynucleotide molecule further comprises the following: anucleic acidmolecule 

encoding the asd amino acid sequence of SEQ ID NO: 4; anucleic acidmolecule 
encoding the dapA amino acid sequence of SEQ ID NO:6; a nucleic acid 
molecule encoding the dapB amino acid sequence of SEQ ID NO:8; a nucleic 
acid molecule encoding the ddh amino acid sequence of SEQ ID NO: 1 0; and a 
10 nucleic acid molecule encoding the ORF2 amino acid sequence of SEQ ID 

NO: 16. 

In another particular embodiment of the method, the isolated 
polynucleotide molecule further comprises the following: anucleic acid molecule 
encoding the asd amino acid sequence of SEQ ID NO:4, anucleic acid molecule 

15 encoding the dap A amino acid sequence of SEQ ID NO: 6; a nucleic acid 

molecule encoding the dapB amino acid sequence of SEQ ID NO: 8; a nucleic 
acid molecule encoding the ddh amino acid sequence of SEQ ID NO: 10; a 
nucleic acid molecule encoding the 'lysA amino acid sequence of SEQ ID 
NO:21; and a nucleic acid molecule encoding the ORF2 amino acid sequence of 

20 SEQ ID NO: 16. 

In another particular embodiment of the method, the polynucleotide 
molecule further comprises the following: a nucleic acid molecule encoding the 
asd amino acid sequence of SEQ IDNO:4; anucleic acidmolecule encoding the 
dapA amino acid sequence of SEQ ID NO: 6; a nucleic acid molecule encoding 

25 the dapB amino acid sequence of SEQ ID NO:8; a nucleic acid molecule 

encoding the ddh amino acid sequence of SEQ ID NO: 10; a nucleic acid 
molecule encoding the lysA amino acid sequence of SEQ ID NO: 14; and a 
nucleic acid molecule encoding the ORF2 amino acid sequence of SEQ ID 
NO:16. 
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In another embodiment of the method, the method further comprises 
growing said transformed host cell in a medium; and purifying an amino acid 
produced by said transformed host cell. 

It is another object of the invention to provide an isolated polynucleotide 
5 molecule comprising the polynucleotide molecule comprising a nucleotide 

sequence encoding the polypeptide comprising the amino acid sequence of SEQ 
ID NO. 2; and at least one additional Corynebacterium species lysine pathway 
gene selected from the group consisting of a nucleic acid molecule encoding an 
^^polypeptide; anucleic acidmolecule encoding a dapA polypeptide; anucleic 
10 acid molecule encoding a dap B polypeptide; a nucleic acid molecule encoding 

a ddh polypeptide; a nucleic acid molecule encoding a 'lysA polypeptide; a 
nucleic acid molecule encoding a lysA polypeptide; and a nucleic acid molecule 
encoding an ORF2 polypeptide. In apreferred embodiment, said asc/polypeptide 
is SEQ IDNO:4; said dapA polypeptide is SEQ IDNO:6; said dapB polypeptide 
1 5 is SEQ ID NO: 8; said ddh polypeptide is SEQ ID NO: 1 0; said 'lysA polypeptide 

is SEQ ID NO: 21; said lysA polypeptide is SEQ ID NO: 14; and said ORF2 
polypeptide is SEQ ID NO: 16. 

It is another object of the invention to provide an isolated polynucleotide 
molecule comprising the polynucleotide molecule comprising a nucleotide 
20 sequence encoding the polypeptide comprising the amino acid sequence of SEQ 

ID NO 2; a nucleic acid molecule encoding the asd amino acid sequence of SEQ 
ID NO:4; a nucleic acid molecule encoding the dapA amino acid sequence of 
SEQ ID NO: 6; anucleic acidmolecule encoding the dapB amino acid sequence 
of SEQ ID NO: 8; and a nucleic acid molecule encoding the ORF2 amino acid 
25 sequence of SEQ ID NO: 1 6. 

It is another object of the invention to provide an isolated polynucleotide 
molecule comprising the polynucleotide molecule comprising a nucleotide 
sequence encoding the polypeptide comprising the amino acid sequence of SEQ 
ID NO: 2; a nucleic acid molecule encoding the asd amino acid sequence of 
30 SEQ ID NO:4; a nucleic acid molecule encoding the dapA amino acid sequence 

of SEQ ID NO: 6; a nucleic acid molecule encoding the dapB amino acid 
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sequence of SEQ ID NO: 8; a nucleic acid molecule encoding the ddh amino 
acid sequence of SEQ ID NO: 10; and a nucleic acid molecule encoding the 
ORF2 amino acid sequence of SEQ ID NO: 16. 

It is another object of the invention to provide an isolated polynucleotide 
molecule comprising the polynucleotide molecule comprising a nucleotide 
sequence encoding the polypeptide comprising the amino acid sequence of SEQ 
ID NO:2; anucleic acidmolecule encoding the asd amino acid sequence of SEQ 
ID NO: 4; a nucleic acid molecule encoding the dapA amino acid sequence of 
SEQ ID NO: 6; a nucleic acid molecule encoding the dapB amino acid sequence 
of SEQ ID NO: 8; anucleic acidmolecule encoding the ddh amino acid sequence 
of SEQ ID NO: 10; a nucleic acid molecule encoding the 'lysA amino acid 
sequence of SEQ ID NO:21; and a nucleic acid molecule encoding the ORF2 
amino acid sequence of SEQ ID NO: 16. 

It is another object of the invention to provide an isolated polynucleotide 
molecule comprising the polynucleotide molecule comprising a nucleotide 
sequence encoding the polypeptide comprising the amino acid sequence of SEQ 
IDNO:2; anucleic acidmolecule encoding the asd amino acid sequence of SEQ 
ID NO:4; a nucleic acid molecule encoding the dapA amino acid sequence of 
SEQ ID NO: 6; a nucleic acid molecule encoding the dapB amino acid sequence 
of SEQIDNO:8; anucleic acidmolecule encoding the ddh amino acidsequence 
of SEQ ID NO: 10; a nucleic acid molecule encoding the lysA amino acid 
sequence of SEQ ID NO: 14; and a nucleic acid molecule encoding the ORF2 
amino acid sequence of SEQ ID NO: 16. 

It is a further object of the invention to provide an isolated polynucleotide 
molecule comprising pK184-KDAB. It is a further object of the invention to 
provide an isolated polynucleotide molecule comprising pKl 84-KDABH'L. It 
is a further object of the invention to provide an isolatedpolynucleotide molecule 
comprising pDl l-KDABH'L. It is a further object of the invention to provide 
an isolated polynucleotide molecule comprising pD2-KDABHL. 

It is a further object of the invention to provide a vector comprising the 
isolated polynucleotide molecule comprising a nucleotide sequence encoding a 



-30- 



polypeptide comprising the amino acid sequence of SEQ ID NO 2; and further 
comprising at least one additional Corynebacterium species lysine pathway gene 
selected from the group consisting of a nucleic acid molecule encoding an asd 
polypeptide; anucleic acidmolecule encoding a dapA polypeptide; anucleic acid 
5 molecule encoding a dapB polypeptide; a nucleic acid molecule encoding a ddh 

polypeptide; a nucleic acid molecule encoding a 'lysA polypeptide; a nucleic 
acidmolecule encoding a lysA polypeptide; and anucleic acidmolecule encoding 
an ORF2 polypeptide. 

It is a further object to provide a host cell comprising a vector 

10 comprising the isolated polynucleotide molecule comprising a nucleotide 

sequence encoding apolypeptide comprisingthe amino acidsequence of SEQ ED 
NO 2; and further comprising at least one additional Corynebacterium species 
lysine pathway gene selected from the group consisting of a nucleic acid 
molecule encoding an asd polypeptide; anucleic acidmolecule encoding a dapA 

15 polypeptide; anucleic acidmolecule encoding a dapB polypeptide; anucleic acid 

molecule encoding a ddh polypeptide; a nucleic acid molecule encoding a 'lysA 
polypeptide; a nucleic acid molecule encoding a lysA polypeptide; and anucleic 
acid molecule encoding an ORF2 polypeptide. 

It is a further object to provide a host cell wherein said host cell is a 

20 Brevibacterium selected from the group consisting of Brevibacterium flavum 

NRRL-B30218, Brevibacterium flavum NRRL-B30219, Brevibacterium 
lactofermentum NRRL-B30220, Brevibacterium lactofermentum NRRL-B3 022 1 , 
Brevibacterium lactofermentum NRRL-B30222, Brevibacterium flavum 
NRRL-30234 and Brevibacterium lactofermentum NRRL-30235. In another 

25 embodiment, the host cell is Escherichia coli DH5 a MCR NRRL-B30228. In 

another embodiment, the host cell is a C. glutamicum selected from the group 
consisting of C. glutamicum NRRL-B30236 and C. glutamicum NRRL-B30237. 

The invention provides processes for the production of an amino acid. 
In one embodiment, the invention provides a process for producing an amino 

30 acid comprising: (a) transforming a Corynebacterium species host cell with an 

isolated nucleic acid molecule; (b) amplifying the number of chromosomal 
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copies of at least one of the biosynthetic pathway genes for said amino acid; (c) 
selecting a transformed host cell; (d) growing said transformed cell in amedium; 
and (e) purifying said amino acid. 

The invention is also directed to an isolated polypeptide comprising the 
5 amino acid sequence of SEQ ID NO: 1 9. In one embodiment of the invention, the 

polypeptide has at least 95% sequence identity to the amino acid sequence of 
SEQ ID NO: 19. The invention is also directed to an isolated polynucleotide 
molecule comprising anucleotide sequence encoding the polypeptide of SEQ ID 
NO: 19. In one embodiment, the isolated polynucleotide comprises anucleic acid 

1 0 having the s equence of S EQ ID NO : 1 8 . 

The invention is also directed to a vector comprising the polynucleotide 
molecule comprising anucleotide sequence encoding the polypeptide comprising 
the amino acid sequence of SEQ ED NO: 19. In one embodiment, the invention 
is directed to a host cell comprising a vector encoding a polypeptide comprising 

1 5 the amino acid sequence of SEQ ID NO: 1 9. In one embodiment, the host cell is 

NRRL B30360. 

The invention is also directed to a method comprising transforming a 
Corynebacterium species host cell with the polynucleotide molecule comprising 
a nucleotide sequence encoding a polypeptide comprising the amino acid 

20 sequence of SEQ ID NO: 19, and selecting a transformed host cell. In one 

embodiment, the method further comprises screening for increased amino acid 
production. In a preferred embodiment, the amino acid screened for is lysine. In 
one embodiment, the polynucleotide molecule is integrated into said host cell's 
chromosome, thereby increasing the total number of said amino acid biosynthetic 

25 pathway genes in said host cell chromosome. 

In another embodiment, the polynucleotide molecule further comprises 
at least one of the following: (a) a nucleic acid molecule encoding a 
Corynebacterium species lysine pathway ask amino acid sequence; (b) a nucleic 
acid molecule encoding a Corynebacterium species lysine pathway asd amino 

30 acid sequence; (c) a nucleic acid molecule encoding a Corynebacterium species 

lysine pathway dap A amino acid sequence; (d) anucleic acid molecule encoding 
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a Corynebacterium species lysine pathway dapB amino acid sequence; (e) a 
nucleic acid molecule encoding a Corynebacterium species lysine pathway ddh 
amino acid sequence; (f) a nucleic acid molecule encoding a Corynebacterium 
species lysine pathway 'lysA amino acid sequence; (g) a nucleic acid molecule 
encoding a Corynebacterium species lysine pathway lysA amino acid sequence; 
and, (h) a nucleic acid molecule encoding an ORF2 polypeptide having SEQ ID 
NO: 16. In this embodiment, the method further comprises screening for 
increased amino acid production. In another embodiment, the amino acid 
screened for is lysine. 

In another embodiment of the method, the polynucleotide molecule 
further comprises: (a) a nucleic acid molecule encoding the ask amino acid 
sequence having SEQ ID NO:2; (b) a nucleic acid molecule encoding a 
Corynebacterium species lysine pathway as d amino acid sequence; (c) a nucleic 
acid molecule encoding a Corynebacterium species lysine pathway dapB amino 
acid sequence; (d) a nucleic acid molecule encoding a Corynebacterium species 
lysine pathway ddh amino acid sequence; and, (e) a nucleic acid molecule 
encoding a Corynebacterium species lysine pathway lysA amino acid sequence. 
In one embodiment of this method, the method further comprises screening for 
increased amino acid production. 

The invention is also directed to an isolated polypeptide comprising the 
amino acid sequence of SEQ IDNO:21 . In one embodiment, the polypeptide has 
at least 95% sequence identity to the amino acid sequence of SEQ ID NO:21 . 
The invention also comprises an isolated polynucleotide molecule comprising a 
nucleotide sequence encoding the polypeptide comprising the amino acid 
sequence having at least 95% sequence identity to the amino acid sequence of 
SEQ ID N0.21 . The invention is further comprises a polynucleotide molecule 
comprising a nucleic acid having the sequence of SEQ ID NO: 20. In one 
embodiment the invention comprises a vector comprising the polynucleotide 
molecule comprising a nucleotide sequence encoding the polypeptide comprising 
the amino acid sequence having at least 95% sequence identity to the amino acid 
sequence of SEQ ID NO:21. The invention further comprises a host cell 
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comprising the vector comprising the polynucleotide molecule comprising a 
nucleotide sequence encoding the polypeptide comprising the amino acid 
sequence having at least 95% sequence identity to the amino acid sequence of 
SEQ IDNO:21. 

In one embodiment, the invention comprises a host cell selected from 
the group consisting of NRRL B30218, NRRL B30220 and NRRL B3 0222. 

The invention is further directed to a method comprising transforming a 
Corynebacterium species host cell with a polynucleotide molecule comprising a 
nucleotide sequence encoding the polypeptide comprising the amino acid 
sequence having at least 95% sequence identity to the amino acid sequence of 
SEQ ID NO: 21, and selecting a transformed host cell. The method further 
comprises screening for increased amino acid production; in particular, for lysine 
production. In one embodiment, the polynucleotide molecule is integrated into 
said host cell's chromosome, thereby increasing the total number of said amino 
acid biosynthetic pathway genes in said host cell chromosome. In one 
embodiment the method further comprises a polynucleotide molecule further 
comprising at least one of the following: (a) a nucleic acid molecule encoding a 
Corynebacterium species lysine pathway ask amino acid sequence; (b) a nucleic 
acid molecule encoding a Corynebacterium species lysine pathway ask amino 
acid sequence having SEQ ID NO. 2; (c) a nucleic acid molecule encoding a 
Corynebacterium species lysine pathway asd amino acid sequence; (d) a nucleic 
acid molecule encoding a Corynebacterium species lysine pathway dapA amino 
acid sequence; (e) a nucleic acid molecule encoding a Corynebacterium species 
lysine pathway dapB amino acid sequence; (f) a nucleic acid molecule encoding 
a Corynebacterium species lysine pathway ddh amino acid sequence; (g) a 
nucleic acidmolecule encoding a Corynebacterium species lysine pathway ORF2 
amino acid sequence; and, (h) a nucleic acid molecule encoding a truncated 
Corynebacterium species lysine pathway ORF2 amino acid sequence. In one 
embodiment, the method further comprises screening for increased amino acid 
production. In another embodiment, the amino acid screened for is lysine. 



-34- 



Another embodiment of the invention is also directed to an isolated 
polynucleotide molecule comprising a nucleotide sequence encoding the 
polypeptide comprising the amino acid sequence of SEQ ID NO: 2 , wherein the 
polynucleotide molecule further comprises a promoter sequence having SEQ ID 
NO: 17. In one embodiment, the promoter sequence has at least 95% sequence 
identity to SEQ ID NO: 17. In one embodiment, the promoter sequence having 
at least 95% sequence identity to SEQ ID NO: 17 is operably directly linked to 
the LysA gene. In another embodiment of the invention, there is a vector 
comprising the isolated polynucleotide molecule comprising a nucleotide 
sequence encoding the polypeptide comprising the amino acid sequence of SEQ 
ID NO:2, wherein the polynucleotide molecule further comprises a promoter 
sequence wherein said promoter sequence has at least 95% sequence identity to 
SEQ ID NO: 17. In another aspect of the invention, there is a host cell 
comprising the vector comprising the isolated polynucleotide molecule 
comprising a nucleotide sequence encoding the polypeptide comprising the 
amino acid sequence of SEQ ID NO:2, wherein the polynucleotide molecule 
further comprises a promoter sequence having at least 95% sequence identity to 
SEQ ID NO: 17. In one embodiment, the host cell is NRRL B30359. 

The invention is also directed to a method comprising transforming a 
Corynebacterium species host cell with the polynucleotide molecule comprising 
a nucleotide sequence encoding the polypeptide comprising the amino acid 
sequence of SEQ ID NO:2, wherein the polynucleotide molecule further 
comprises a promoter sequence having at least 95% sequence identity to SEQ ID 
NO: 17, and selecting a transformed host cell. In one embodiment, the method 
further comprises screening for increased amino acid production. In another 
embodiment, the amino acid screened for is lysine. In another embodiment of 
the method, the polynucleotide molecule is integrated into said host cell's 
chromosome, thereby increasing the total number of amino acid biosynthetic 
pathway genes in said host cell chromosome. In another embodiment of the 
method, the polynucleotide molecule further comprises at least one of the 
following: (a) a nucleic acid molecule encoding a Corynebacterium species 
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lysine pathway asd amino acid sequence; (b) a nucleic acid molecule encoding 
a Corynebacterium species lysine pathway dapA amino acid sequence; (c) a 
nucleic acid molecule encoding a Corynebacterium species lysine pathway dapB 
amino acid sequence; (d) a nucleic acid molecule encoding a Corynebacterium 
species lysine pathway ddh amino acid sequence; (e) a nucleic acid molecule 
encoding a Corynebacterium species lysine pathway Oi?F2 amino acid sequence; 
(f) a nucleic acid molecule encoding a truncated Corynebacterium species lysine 
pathway amino acid sequence; (g) a nucleic acid molecule encoding a 

Corynebacterium species lysine pathway (y^ amino acid sequence; and, (h) a 
nucleic acid molecule encoding a truncated Corynebacterium species lysine 
pathway lysA amino acid sequence. In this embodiment, the method further 
comprises screening for increased amino acid production; in particular, for lysine 
production. 

In a different embodiment of the method, the polynucleotide molecule 
comprises: (a) a nucleic acid molecule encoding a Corynebacterium species 
lysine pathway asd amino acid sequence; (b) a nucleic acid molecule encoding 
a Corynebacterium species lysine pathway dap A amino acid sequence; (c) a 
nucleic acid molecule encoding a Corynebacterium species lysine pathway dapB 
amino acid sequence; (d) a nucleic acid molecule encoding a Corynebacterium 
species lysine pathway ddh amino acid sequence; (e) a nucleic acid molecule 
encoding a Corynebacterium species lysine pathway ORF2 amino acid sequence; 
and, (f) a nucleic acid molecule encoding a Corynebacterium species lysine 
pathway lysA amino acid sequence. In this embodiment, the method further 
comprises screening for increased amino acid production. In a preferred 
embodiment, the amino acid is lysine. 

A variety of media known to those skilled in the art may be used to 
support cell growth for the production of an amino acid. Illustrative examples 
of suitable carbon sources include, but are not limited to: carbohydrates, such as 
glucose, fructose, sucrose, starch hydrolysate, cellulose hydrolysate and 
molasses; organic acids, such as acetic acid, propionic acid, formic acid, malic 
acid, citric acid, and fumaric acid; and alcohols, such as glycerol. Illustrative 
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examples of suitable nitrogen sources include, but are not limited to: ammonia, 
including ammonia gas and aqueous ammonia; ammonium salts of inorganic or 
organic acids, such as ammonium chloride, ammonium phosphate, ammonium 
sulfate and ammonium acetate; and other nitrogen-containing sources, including 
5 meat extract, peptone, corn steep liquor, casein hydrolysate, soybean cake 

hydrolysate, urea and yeast extract. 

A variety of fermentation techniques are known in the art which may be 
employed in processes of the invention drawn to the production of amino acids. 
Generally, amino acids may be commercially produced from the invention in 

1 0 fermentation processes such as the batch type or of the fed-batch type. In batch 

type fermentations, all nutrients are added at the beginning of the fermentation. 
In fed-batch or extended fed-batch type fermentations one or a number of 
nutrients are continuously supplied to the culture, right from the beginning of the 
fermentation or after the culture has reached a certain age, or when the nutrient(s) 

1 5 which are fed were exhausted from the culture fluid. A variant of the extended 

batch of fed-batch type fermentation is the repeated fed-batch or fill-and-draw 
fermentation, where part of the contents of the fermenter is removed at some 
time, for instance when the fermenter is full, while feeding of a nutrient is 
continued. In this way a fermentation can be extended for a longer time. 

20 Another type of fermentation, the continuous fermentation or chemostat 

culture, uses continuous feeding of a complete medium, while culture fluid is 
continuously or semi-continuously withdrawn in such a way that the volume of 
the broth in the fermenter remains approximately constant. A continuous 
fermentation can in principle be maintained for an infinite time. 

25 In a batch fermentation an organism grows until one of the essential 

nutrients in the medium becomes exhausted, or until fermentation conditions 
become unfavorable (e.g., the pH decreases to a value inhibitory for microbial 
growth). In fed-batch fermentations measures are normally taken to maintain 
favorable growth conditions, e.g., by using pH control, and exhaustion of one or 

30 more essential nutrients is prevented by feeding these nutrient(s) to the culture. 

The microorganism will continue to grow, at a growth rate dictated by the rate 
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of nutrient feed. Generally a single nutrient, very often the carbon source, will 
become limiting for growth. The same principle applies for a continuous 
fermentation, usually one nutrient in the medium feed is limiting, all other 
nutrients are in excess. The limiting nutrient will be present in the culture fluid 
5 at a very low concentration, often unmeasurably low. Different types of nutrient 

limitation can be employed. Carbon source limitation is most often used. Other 
examples are limitation by the nitrogen source, limitation by oxygen, limitation 
by a specific nutrient such as a vitamin or an amino acid (in case the 
microorganism is auxotrophic for such a compound), limitation by sulphur and 

1 0 limitation by phosphorous . 

The amino acid may be recovered by any method known in the art. 
Exemplary procedures are provided in the following: Van Walsem, H.J. & 
Thompson, M.C., J. Biotechnol. 59: 127-132 (1997), and U.S. Pat. No. 
3,565,951, both of which are incorporated herein by reference. 

1 5 The invention described herein provides isolated nucleic acid molecules 

comprising at least one L-lysine amino acid biosynthesis gene. Unless otherwise 
indicated, all nucleotide sequences described herein were determined using an 
automated DNA sequencer (such as the Model 373 from Applied Biosystems, 
Inc.), and all amino acid sequences of polypeptides encoded by DNA molecules 

20 described herein were predicted by translation of the relative DNA sequence. 

Therefore, as is known in the art, for any DNA sequence determined by this 
automated approach, any nucleotide sequence determined herein may contain 
some errors. Nucleotide sequences determined by automation are typically at 
least about 90% identical, more typically at least about 95% to at least about 

25 99.9% identical to the actual nucleotide sequence of the sequenced DNA 

molecule. The actual sequence can be more precisely determined by other 
approaches including manual DNA sequencing methods well known in the art. 

As is also known in the art, a single insertion or deletion in a determined 
nucleotide sequence compared to the actual sequence will cause a frame shift in 

30 translation of the nucleotide sequence such that the predicted amino acid 

sequence encoded by a determined nucleotide sequence will be completely 
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different from the amino acid sequence actually encoded by the sequenced DNA 
molecule, beginning at the point of such an insertion or deletion. 

The invention provides several isolated nucleic acid molecules encoding 
comprising at least one L-lysine amino acid biosynthesis pathway gene of 
Corynebacterium glutamicum. More specifically, the invention provides the 
following isolated nucleic acid molecules: the nucleotide sequence of the ask 
gene from the strain ATCC 21529 (SEQ ID NO: 1); the nucleotide sequence of 
the asd gene from the strain ATCC 21529 (SEQ ID NO:3); the nucleotide 
sequence of the dapA gene from the strain NRRL-B 1 1474 (SEQ ID NO:5 ); the 
nucleotide sequence of the dapB gene from the strain NRRL-B 1 1474 (SEQ ID 
NO: 7); the nucleotide sequence of the ddh gene from the strain NRRL-B 1 1474 
(SEQ ID NO: 9) and the nucleotide sequence of the ORF2 gene from the strain 
NRRL-B 11474 (SEQ ID NO: 15). In addition, also provided herein is the 
nucleotide sequence of lysA (SEQ ID NO: 13) gene fromplasmidpRS6 (Marcel, 
T., et al, Molecular Microbiology 4: 1819-1830 (1990)). 

It is known in the art that amino acids are encoded at the nucleic acid 
level by one or more codons (code degeneracy). It is also known in the art that 
choice of codons may influence expression of a particular amino acid sequence 
(protein, polypeptide, etc.). Thus, the invention is further directed to nucleic acid 
molecules encoding the ask amino acid sequence of SEQ ID NO:2 wherein the 
nucleic acid molecule comprises any codon known to encode a particular amino 
acid. The invention is also further directed to nucleic acid sequences (SEQ ID 
NOs:l, 3, 5, 7, 9, 11, 13, 15, 18 and 20) which comprise alternative codons in 
order to optimize expression of the protein or polypeptide. 

In addition to the above described isolated nucleic acid molecules, the 
invention also provides isolatednucleic acidmolecules comprising more than one 
L-lysine Corynebacterium glutamicum biosynthesis gene. Such isolatednucleic 
acidmolecules are referred to as "cassette" constructs. These cassette constructs 
simplify for the practitioner the number of recombinant DNA manipulations 
required to achieve gene amplification of L-lysine biosynthesis genes. 
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In one embodiment drawn to a cassette construct, the invention provides 
an isolated nucleic acid molecule comprising: (a) a polynucleotide encoding the 
Corynebacteriumglutamicum L-lysine pathway askdxamo acidsequence of SEQ 
ID NO: 2; and (b) at least one additional Corynebacterium species L-lysine 
5 pathway gene selected from the group consisting of: (1) a polynucleotide 

encoding the asd polypeptide; (2) a polynucleotide encoding the dapA 
polypeptide; (3) a polynucleotide encoding the dapB polypeptide; (4) a 
polynucleotide encoding the ddh polypeptide; (5) a polynucleotide encoding the 
'lysA polypeptide, and (6) a polynucleotide encoding the ORF2 polypeptide. 

10 The isolated nucleic acid molecules of the invention are preferably 

propagated and maintained in an appropriate nucleic acid vector. Methods for 
the isolation and cloning of the isolated nucleic acid molecules of the invention 
are well known to those skilled in the art of recombinant DNA technology. 
Appropriate vectors and methods for use with prokaryotic and eukaryotic hosts 

15 are described by Sambrook et ah, Molecular Cloning: A Laboratory Manual, 

Second Edition, Cold Spring Harbor, N.Y., 1989, the disclosure of which is 
hereby incorporated by reference. 

A great variety of vectors can be used in the invention. Such vectors 
include chromosomal, episomal and virus-derived vectors, e.g., vectors derived 

20 from bacterial plasmids and from bacteriophage, as well as vectors derived from 

combinations thereof, such as those derived from plasmid and bacteriophage 
genetic elements, such as cosmids andphagemids, all may be used in accordance 
with this aspect of the present invention. Generally, any vector suitable to 
maintain and propagate a polynucleotide in a bacterial host may be used in this 

25 regard. 

A large numbers of suitable vectors and promoters for use in bacteria are 
known, many of which are commercially available. Preferred prokaryotic 
vectors include plasmids such as those capable of replication in E. coli (such as, 
for example, pBR322, ColEl, pSClOl, pACYC 184, tiVX). Such plasmids are, 
30 for example, disclosed by Maniatis, T., et ah, In: Molecular Cloning, A 

Laboratory Manual, Cold Spring Harbor Press , Cold Spring Harbor, NY ( 1 982)) . 
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The following vectors are provided by way of example: pET (Novagen), pQE70, 
pQE60, pQE-9 (Qiagen), pBs, phagescript, psiX174, pBlueScript SK, pBsKS, 
pNH8a, pNH16a, pNH18a, pNH46a (Stratagene), pTrc99A, pKK223-3, 
pKK233-3, pDR540, pRIT5 (Pharmacia). 
5 Preferred vectors for the isolated nucleic acid molecules of the invention 

include the pFCl to pFC7 novel family of combinatorial cloning vectors 
(Lonsdale, D.M., et al, Plant Molecular Biology Reporter 13: 343-345 (1995)), 
the pKl 84 vector (Jobling, M.G. and Homes, R.K., Nucleic Acid Research 18: 
5315-5316(1990)). 

10 Another group of preferred vectors are those that are capable of 

autonomous replication in Corynebacterium species. Such vectors are well 
known to those skilled in the art of amino acid production by way of microbial 
fermentation, examples of which include pSR 1 , pMF 1014a and vectors derived 
therefrom. 

15 The invention provides an isolated amino acid sequence of the ask 

polypeptide of the strain ATCC 21 529 (SEQ ID NO:2). The isolated ask amino 
sequence disclosed herein possesses unique properties with respect to feedback 
resistance of ask enzyme activity to accumulated levels of L-lysine and 
L-threonine in the culture medium. When compared to the DNA sequences of 

20 other Corynebacterium glutamicum ask-asd gene sequences, the invention 

discloses a threonine to isoleucine change at amino acid residue 380 which 
results in resistance to feedback inhibition. The invention also includes other 
amino acid changes at residue 380 which result in decreased ask enzyme 
sensitivity to L-threonine and/or L-lysine. 

25 In addition, and as described in more detail herein, the vector may 

contain control regions that regulate as well as engender expression. Generally, 
such regions will operate by controlling transcription, such as inducer or 
repressor binding sites and enhancers, among others. 

Vectors of the present invention generally will include a selectable 

30 marker. Such markers also may be suitable for amplification or the vectors may 

contain additional markers for this purpose. In this regard, vectors preferably 
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contain one or more selectable marker genes to provide a phenotypic trait for 
selection of transformed host cells. Such markers include, but are not limited to, 
an antibiotic resistance gene such as a chloramphenicol, ampicillin, or kanamycin 
resistance gene, or an autotrophic gene which allows the host cell to grow in the 
5 absence of a nutrient for which the host cell strain is normally auxotrophic. 

If the vector is intended to be maintained in the host cell 
extrachromosomally, it will contain, in addition and origin of replication which 
will allow it to replicate in the Corynebacterium species host cell. Alternatively, 
if it is desired that the vector integrate into the Corynebacterium species 

10 chromosome, the vector is constructed such that it cannot replicate in 

Corynebacterium. For example, such a vector might be capable of propagation 
in another organism, for example, E. coli, but lack the proper origin of 
replication to be propagated in Corynebacterium. In another aspect of this 
embodiment, the vector is a shuttle vector which can replicate and be maintained 

1 5 in more than one host cell species, for example, such a shuttle vector might be 

capable of replication in a Corynebacterium host cell such as a C. glutamicum 
host cell, and also in an E. coli host cell. 

The invention further provides the following isolated the amino acid 
sequences: the amino acid sequence of the asd polypeptide of the strain ATCC 

20 21 529 (SEQ ID NO:4); the amino acid sequence of the dapA polypeptide of the 

strain NRRL-B 11474 (SEQ ID NO: 6); the amino acid sequence of the dapB 
polypeptide of the strain NRRL-B11474 (SEQ ID NO:8); the amino acid 
sequence of the ddh polypeptide of the strain NRRL-B 1 1474 (SEQ ID NO: 10) 
and the amino acid sequence of the ORF2 polypeptide of the strain NRRL- 

25 Bl 1474 (SEQ ID NO: 16). In addition, also provided herein is the amino acid 

sequence of lysA (pRS6) (Marcel, T., et al, Mol Microbiol. 4: 819-830 (1990)) 
(SEQ ID NO: 14). 

In addition to the isolated polypeptide sequences defined by the specific 
sequence disclosures disclosed above, the invention also provides the amino acid 
30 sequences encoded by the deposited clones. 
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It will be recognized in the art that some amino acid sequences of the 
invention can be varied without significant effect of the structure or function of 
the proteins disclosed herein. Variants included may constitute deletions, 
insertions, inversions, repeats, and type substitutions so long as enzyme activity 
5 is not significantly affected. Guidance concerning which amino acid changes are 

likely to be phenotypically silent can be found in Bowie, J.U., et al, 
"Deciphering the Message in Protein Sequences: Tolerance to Amino Acid 
Substitutions," Science 247; 1306-13 10 (1990). 

The strains of the invention may be prepared by any of the methods and 
10 techniques known and available to those skilled in the art. Introduction of gene 

constructs of the invention into the host cell can be effected by electroporation, 
transduction or other methods. These methods are described in the many 
standard laboratory manuals referenced and incorporated herein. 

Various embodiments of the invention provide strains with increased 
15 L-lysine production as a result of gene amplification. By gene amplification is 

meant increasing the number of copies above the normal single copy number of 
an L-lysine biosynthesis pathway gene by a factor of 2, 3, 4, 5, 10, or more 
copies. 

In one embodiment of the invention, the additional copies of the L-lysine 
20 biosynthesis pathway gene(s) may be integrated into the chromosome. Another 

embodiment of the invention provides that the additional copies of the L-lysine 
biosynthesis pathway gene(s) are carried extra-chromosomally. Amplifications 
by a factor of 5 or less may be obtained by introducing the additional gene copies 
into the chromosome of the host strain by way of single event homologous 
25 recombination. In a most preferred embodiment, the recombination event results 

in the introduction of one additional copy of the copy of the gene or genes of 
interest. If more than 5 copies of the genes are desired, then the invention also 
provides for the use of multicopy plasmids carrying the recombinant DNA 
construct of the invention. 
30 Representative examples of appropriate hosts for isolated nucleic acid 

molecules of the invention include, but are not limited to, bacterial cells, such as 
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C. glutamicum, Escherichia coli, Streptomyces and Salmonella typhimurium 
cells; and fungal cells, such as yeast cells. Appropriate culture media and 
conditions for the above-described host cells are known in the art. 

Particularly preferred host cells of the invention include: 
5 Corynebacterium glutamicum, Brevibacterium flavum and Brevibacterium 

lactofermentum. 

Applicants have deposited clones carrying the pKl 84-KDABH'L multi- 
gene constructs at an acceptable International Depositary Authority in accordance 
with the Budapest Treaty on the International Recognition of the Deposit of 

10 Microorganisms for the Purposes of Patent Procedure. The deposits have been 

made with the Agricultural Research Service, Culture Collection (NRRL), 1815 
North University Street, Peoria, Illinois 61604. Deposits made in which the 
pKl 84-KDAB or pKl 84-KDABH'L multi-gene constructs have been integrated 
into the chromosome of a host cell include the following: (1) the pKl 84-KDAB 

15 plasmid, integrated into the chromosome, deposited as NRRL-B30219 and 

NRRL -B30221 and (2) the pKl 84-KDABH'L plasmid, integrated into the 
chromosome, deposited as NRRL-B3 02 1 8, NRRL-B30220, andNRRL-B30222. 
In addition, the pKl 84-KDABH'L multigene construct in a plasmid 
configuration, carried in E. coli DH5 a MCR, was deposited as NRRL-B30228. 

20 The six gene construct (pDElia2-KDABHL) was deposited in E. coli (NRRL- 

B30233). C. glutamicum comprising pKl 84-KDABH'L was deposited as 
NRRL-B30236. C. glutamicum comprising pKl 84-KDAB HL was deposited as 
NRRL-B30237. Brevibacterium flavum comprising pDElia2-KDABHL was 
deposited as NRRL-B30234. Brevibacterium lactofermentum comprising 

25 pDElia2-KDABHL was deposited as NRRL-B30235. 

It is an object of the invention to provide a method of producing lysine 
comprising culturing the host cells comprising the amino acid sequence of SEQ 
ID NO:2 wherein said host cells comprise one or more of: (a) increased enzyme 
activity of one or more lysine biosynthetic pathway enzymes compared to the 

30 genetically unaltered host cell; (b) one or more copies of each gene encoding a 

lysine biosynthetic pathway enzyme; and, (c) alteration of one or more 
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transcription factors regulating transcription of one or more genes encoding a 
lysine biosynthetic pathway enzyme, wherein said host cell produces lysine in 
said culture medium. In one embodiment of the method, said increased enzyme 
activity comprises overexpressing one or more genes encoding one or more 
lysine biosynthetic pathway enzymes. In one embodiment of the method, said 
one or more genes are operably linked directly or indirectly to one or more 
promoter sequences . In another embodiment of the method, said operably linked 
promoter 

sequences are heterologous, endogenous, or hybrid. In a preferred embodiment 
of the method, said promoter sequences are one or more of: a promoter sequence 
from the 5' end of genes endogenous to C. glutamicum, a promoter sequence 
from plasmids that replicate in C. glutamicum, and, a promoter sequence from 
the genome of phage which infect C. glutamicum. In a preferred embodiment of 
the method, one or more of said promoter sequences are modified. In another 
preferred embodiment, said modification comprises truncation at the 5' end, 
truncation at the 3' end, non-terminal insertion of one or more nucleotides, non- 
terminal deletion of one or more nucleotides , addition of one or more nucleotides 
at the 5' end, addition of one or more nucleotides at the 3' end, and, combinations 
thereof. 

In another embodiment of the method, said increased enzyme activity 
results from the activity of one or more modified lysine biosynthetic pathway 
enzymes wherein said enzyme 'modification results in a change in kinetic 
parameters, allosteric regulation, or both, compared to the enzyme lacking the 
modification. In one embodiment of the method, said change in kinetic 
parameters is a change in K^, V max or both. In another embodiment of the 
method, said change in allosteric regulation is a change in one or more enzyme 
allosteric regulatory sites. In one embodiment, said change in allosteric 
regulation is a change in the affinity of one or more enzyme allosteric regulatory 
sites for the ligand or ligands. The ligands may be the same or different. In one 
embodiment, said enzyme modification is a result of a change in the nucleotide 
sequence encoding said enzyme. In one embodiment, said change in said 
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nucleotide sequence is an addition, insertion, deletion, substitution, or a 
combination thereof, of one or more nucleotides. 

In another embodiment of the method, said alteration of one or more 
transcription factors comprises one or more mutations in transcription inhibitor 
proteins, one or more mutations in transcription activator proteins, or both, 
wherein said one or more mutations increases transcription of the target 
nucleotide sequence compared to the transcription by said one or more 
transcription factors lacking said alteration. In one embodiment, said one or more 
mutations is a change in said nucleotide sequence encoding said transcription 
factor. In another embodiment, said change in said nucleotide sequence is an 
addition, insertion, deletion, substitution, or a combination thereof, of one or 
more nucleotides. 

All patents and publications referred to herein are expressly incorporated 
by reference in their entirety. 

Examples 
Example 1 

Preparation of L-Lysine Pathway Multi-gene Constructs 
pK184-KDAB and pK184-KDABH'L 

Applicants have created L-lysine amino acid biosynthetic pathway multi- 
gene constructs for the purpose of amplifying the number of one or more of the 
genes of this pathway in the chromosome of Corynebacterium species. Also, 
through careful study of the L-lysine biosynthesis genes of strain ATCC 21529, 
Applicants have identified an amino acid change of threonine to isoleucine at 
amino acid residue 380 of the ask gene of ATCC 2 1 529. Compared to the DNA 
sequences of other Corynebacterium glutamicum ask genes, a threonine to 
isoleucine change at amino acid residue 380 was observed (Figure 1 9), which is 
responsible for the unusual feedback resistant properties with respect to aspartate 
kinase enzyme regulation. 
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The isolated nucleic acid molecules encoding L-lysine, amino acid 
biosynthesis pathway genes utilized in the present invention are from the 
following sources: 

Gene(s) Source 

ask-asd Strain ATCC 2 1 529; 

dapA Strain NRRL Bl 1474; 

dapB Strain NRRL B 1 1 474 ; 

ddh Strain NRRL Bl 1474; 

lysA Plasmid pRS6 (Marcel, T., et al. , Mol. Microbiol 4:819- 

830 (1990)) carrying the lysA gene isolated from strain 
AS019, which was derived from ATCC 13059; 

lysA NRRL B 11474; 

lysA NRRL B 1 1 474 (full length); and, 

ORF2 Strain NRRL B 1 1 474. 

As one skilled in the art would know, the invention is not limited to the 
specific strain origins that Applicants present for the isolated nucleic acid 
molecules of the invention. Any strain of Corynebacterium species, particularly 
that of Corynebacterium glutamicum, may be utilized for the isolation of nucleic 
acid molecules that will be used to amplify the number of chromosomally located 
amino acid biosynthetic pathway genes. Particularly preferred strains include: 
NRRL-B1 1474, ATCC 21799, ATCC 21529, ATCC 21543, and El 2. 

Methods and techniques common to the art of recombinant DNA 
technology were used in making the multi-gene constructs of the invention, as 
may be found in the many laboratory manuals cited and incorporated herein,for 
example as found in J. Sambrook, E.F. Fritsch and T. Maniatis, Molecular 
Cloning: A Laboratory Manual, 2d ed., Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, New York (1989). 

The polymerase chain reaction (PCR) technique is used extensively in the 
making of the multi-gene constructs of the invention. In a typical reaction, the 
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standard 1 OX stock solution ( 1 00 mM Tris-HCL, pH 8.3, 500 mM KCL, 1 .5 mM 
MgCl 2 ) is diluted to IX for use. Typical reaction conditions were used for PCR 
amplication: 10 mM Tris, pH 8.3 , 50 mM KC1, 1.5 mM MgCl 2 , 0.01% gelatin, 
200 fiM deoxynucleotides, 0.2-1.0 jjM primers and2.5U/100//l pfu polymerase. 
5 Standard cycling parameters were also employed in PCR reactions: For 30 

cycles , template denaturation was performed at 94 ° C for 1 vain; 5 5 ° C annealing 
temperature was performed for 1 min (or annealing temperature appropriate for 
particular primer pair); product extension was performed at 72 °C for 1 min (if 
product is <500 bp), 3 min (if product is >500 bp); and at the end of cycling, a 
1 0 final extension at 72 ° C for 7 min was performed. 

The primers utilized for cloning experiments included: 
ask: 5-GGGTACCTCGCGAAGTAGCACCTGTCAC-3'; 
asd: 5'-GCGGATCCCCCATCGCCCCTCAAAGA-3'; 
dapB: 5'-AACGGGC GGTGA AGGGC AACT-3 ' ; 
15 dapA: 5'-TGAAAGACAGGGGTATCCAGA-3'; 

ddh 5'-CC ATGGTACC AAGTGCGTGGCGAG-3 

5'-CCATGGTACCACACTGTTTCCTTGC-3'; 
argS: 5-CTGGTTCCGGCGAGTGGAGCCGACCATTCCGCGAGG-3'; and 
lysA: 5-CTCGCTCCGGCGAGGTCGGAGGCAACTTCTGCGACG-3', a 
20 primer that anneals internally to lysA (about 5 OObp upstream to the end of lysA) . 

'LysA is a truncated form obtained from lysA. 

Applicants utilized standard PCR and subcloning procedures in cloning 
the coding regions of ask-asd, dapB-ORF2-dapA, ddh, 'lysA, and lysA. 
Construction procedures and intermediate plasmids are described in Figure 18. 
25 Applicants performed the following steps (Figure 18) in constructing the 

following vectors used in the L-lysine biosynthetic pathway: 

1. pGEMT -ask-asd: an approximately 2.6 Kb PCR product 
containing the ask-asd operon of ATCC21529 using primers ask and asd was 
cloned into pGEM-T (Promega pGEM-T vector systems); 
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2. pADM21: an approximately 1.3Kb PCR product (with an 
engineered Kpnl site on both primers) of NRRL-B 1 1 474 ddh coding region was 
cloned into pADM20; 

3. pUC 1 S-ddh: an approximately 1.3Kb Kpnl fragment of 
5 pADM21 containing ddh (NRRL-B 1 1474) was subcloned into pUC 18 at the 

Kpnl site; 

4. pLIC 1 J-SLrgS-lysA: PCR product using template NRRL-B 1 1474 
genomic DNA and primers argS and lysA was cloned into pPMG-LIC cloning 
vector (PharMingen); 

10 5. pM4-dapB-ORF2-dapA.: an approximately 3 Kb PCR product 

using primers dapB and dapA was cloned into pM4 at the Xbal site; 

6. pFCi-ask-asd: an approximately 2.6 Kb Nsil-Apal fragment of 
pGEMT-ask-asd was cloned into pFC3 cut with PstI and Apal; 

7. pFCl-ddh: -1.3 Kb Sall-EcoRI fragment of pUClS-ddh was 
15 cloned into pFCl cut with Sail and EcoRI; 

8. pFCl-ddh-lysA: an approximately 1.5 Kb EcoRI fragment 
(containing the truncated lysA DNA) of pLICl. 7 -argS-lys A was cloned into 
pFCl -ddh at the EcoRI site; 

9. p¥C5-dapB-ORF2-dapA: an approximately 3.4 Kb BamHI-BgUI 
20 fragment of pM4-dapB-ORF2-dapA was cloned into pFC5 at the BamHI site; 

10. pFC5-dapB-ORF2-dapA-ddh-lysA: -2.8 Kb Nhel fragment of 
pFC\-ddh-'lysA was cloned into pFC5-dapB-ORF2-dapA at the Nhel site; 

11. pFC-3-ask-asd-dapB-ORF2-dapA-ddh-'lysA: -6.2 Kb NotI 
fragment of pFC5-dapB-ORF2-dapA-ddh-lysA was cloned into pFC3-as k-asd 

25 at the NotI site; 

12. pT)FA\2L9-ask-asd-dapB-ORF2-dapA-ddh- , lysA (pDElia9- 
KDABH'L): -8.8 Kb Pmel fragment of pFC3-ask-asd-dapB-ORF2-dapA-ddh- 
'lysA was cloned into pDElia9 at the EcoRV site; and 

13. pKl S4-ask-asd-dapB-ORF2-dapA-ddh-'lysA (pKl 84-KDABH'L): 
30 an approximately 8.8 Kb Pmel fragment of pFC3-ask-asd-dapB-ORF2-dapA- 

ddh-'lysA was cloned into pKl 84 at the HincII or Smal site. 



-49- 



14. pFC5-ask-asd-dapB-ORF2-dapA (pFC5-KDAB): -2.6 KbKpnl- 
Smal fragment of pFC3-ask-asd was cloned into pFC5-dapB-ORF2-dapA cut 
with Kpnl and Smal. 

15. pK\S4-ask-asd-dapB-ORF2-dapA (pK184-KDAB): ~7KbKpnI- 
5 Pmel fragment of pFC5-ask-asd-dapB-ORF2-dapA was cloned into pK 1 84 cut 

with Kpnl and Hindi. 

Thus, Applicants have made the following L-lysine multi-gene constructs: 

1 . pKl 84-KDABH'L, wherein "K" represents anucleotide sequence 
encoding the ^^polypeptide; "D" represents anucleotide sequence encoding the 

10 asd polypeptide; "A" represents a nucleotide sequence encoding the dapA 

polypeptide; "B" represents a nucleotide sequence encoding the dapB 
polypeptide; "H"' represents a nucleotide sequence encoding the ddh 
polypeptide; and"'L" represents anucleotide sequence encoding part of the 'lysA 
polypeptide. This construct is referred to as a truncated 6 gene construct. The 

15 pK184-KDABHL construct, constructed infra, is referred to as a full length 6 

gene construct. 

2. pK184-KDAB, wherein "K" represents a nucleotide sequence 
encoding the as A: polypeptide; "D" represents anucleotide sequence encoding the 
asd polypeptide; "A" represents a nucleotide sequence encoding the dapA 

20 polypeptide; and "B" represents a nucleotide sequence encoding the dapB 

polypeptide. This construct is referred to as a 4 gene construct. 

Both pKl 84-KDABH'L and pK184-KDAB, as do the other constructs 
discussed herein, comprise the nucleotide sequence encoding the ORF2 
polypeptide. 

25 It should be noted that in addition to the indicated polypeptide sequences 

encoded by the isolated nucleic acid sequences represented by "K", "D", "A", 
"B," "H," "L" and"'L", these isolated nucleic acid sequences also include native 
promoter elements for the operons represented therein. Thus, the ask-asd 
sequences have been cloned in a fashion that includes the respective native 

30 promoter elements; the dapA and dapB sequences, representing the operon dapB- 

ORF2-dapA, have been cloned in a fashion that includes the respective promoter 
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elements; the ddh sequence has been cloned in a fashion that includes the 
respective native promoter elements, and the lysA and 'lysA sequences have been 
cloned in a fashion that includes a native promoter element. 

Alternative gene promoter elements may be utilized in the constructs of 
5 the invention. For example, known bacterial promoters suitable for this use in 

the present invention include the E. coli lad and lacZ promoters, the T3 and 77 
promoters, the gpt promoter, the lambda PR and PL promoters, the trp promoter, 
or promoters endogenous to the bacterial cells of the present invention. Other 
promoters useful in the invention include regulated promoters, unregulated 
10 promoters and heterologous promoters. Many such promoters are known to one 

of skill in the art. See Sambrook, E.F. et ah, Molecular Cloning: A Laboratory 
Manual, 2d ed. , Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York (1989). 



Example 2 

Two-Fold Amplification of L-lysine Amino Acid Biosynthesis 
Pathway Genes 



For exemplary purposes only, Applicants provide herein an example 
wherein at least one L-lysine amino acid biosynthesis pathway gene is amplified 
by a factor of 2 by way of (a) the introduction of an isolated nucleic acid 
20 molecule into a Corynebacterium glutamicum host cell, and (b) the subsequent 

single crossover homologous recombination event introducing said isolated 
nucleic acid molecule into said Corynebacterium glutamicum host cell 
chromosome. 

As will be understood by those in the art, at least one or two or three or 
25 four or five or six or seven or eight or nine or ten or more amino acid 

biosynthesis pathway genes may be amplified, i.e., increased in number, by a 
factor of at least one or two or three or four or five or six or seven or eight or 
nine or ten fold with minor variations of the example presented herein. 
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pK184-KDAB, pKl 84-KDABH'L and pD2-KDABHL(a full length 6 
gene construct constructed in Example 4) plasmids were used in the construction 
of high yield derivative cell lines of the invention. This was accomplished by 
way of introducing plasmid pK184-KDAB, pKl 84-KDABH'L and pD2- 

5 KDABHL DNAs into a Corynebacterium species resulting in incorporation of 

pK184-KDAB, pKl 84-KDABH'L or pD2-KDABHL into the host cell 
chromosome via a single crossover homologous recombination event. 
Amplification of the amino acid biosynthetic pathway genes by way of 
chromosomal integration of the plasmid constructs of the invention provided 

10 increased L-lysine production in several Corynebacterium species strains. 

For cell transformation experiments with the isolated nucleic acid 
molecules of the invention, the growth and preparation of competent cells may 
be done according to the following procedure: (1) picking a fresh, single colony 
of Corynebacterium glutamicum and growing a culture overnight in 10 mL CM 

15 (SMI) in a 250 mL shake flask at 30 degrees Celsius with agitation; 

(2) inoculating 200 mL of "Growth Media" with the overnight culture to an 
optical density (O.D.) of 660 nm of 0. 1 in a 500 mL shake flask; (3) growing the 
culture at 30 degrees Celsius with agitation for 5-6 hours; (4) pouring the culture 
into a chilled, sealed, sterile 250 mL centrifuge bottle; Spin at 8-1 OK for ten 

20 minutes in Refrigerated Sorvall at 4 degrees Celsius; (5) pouring off the 

supernatant thoroughly and resuspending the cell pellet in an equal volume of 
ice-cold, sterile, deionized water; (6) centrifuging the sample again under the 
same conditions; (7) repeating the water wash remembering to keep everything 
ice-cold; (8) pouring off the supernatant thoroughly and resuspending the cell 

25 pellet in 1 mL of ice-cold, sterile 10% glycerol and transferring the cells to a 

chilled, sterile, 1 .5 mL microcentrifuge tube; (9) spin the sample for 10 minutes 
in arefrigerated centrifuge; (10) pipetting off and discarding the supernatant, and 
resuspending the pellet in two to three times the pellet volume (200-400 uL) of 
10% glycerol; and (1 1) alliquoting, if necessary, the cells into chilled tubes and 

30 freezing at -70 Celsius. 
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pK184-KDAB, pKl 84-KDABH'L and pD2-KDABHL plasmid DNAs 
were introduced into Corynebacterium glutamicum host cells by the following 
electroporation procedure: (1) pipetting 3 5 uL cell/glycerol solution onto the side 
wall of a chilled 0 . 1 cm electrocuvette; (2) pipetting about 2-4 uL of plasmi d into 
the solution and mixing the sample by gentle pipetting up and down; (3) bringing 
the entire solution to the bottom of the electrocuvette by gentle tapping, avoiding 
the creation of bubbles; (4) keeping the sample on ice until ready for the 
electroshock step, wiping off any moisture on the outside of the electrocuvette 
prior to the electroshock administration, and shocking the cells one time at 
1.5kV, 200n,25uF. 

Cells are allowed to recover from electroporation by: (1) immediately 
pipetting 1 mL of warm "Recovery Media" into the electrocuvette and 
thoroughly mixing the solution by pipetting; (2) incubating the solution (in the 
electrocuvette) at 30 degrees Celsius for at least three hours for antibiotic 
resistance expression and cell recovery and (3) plating on selection media and 
incubating at 30 degrees Celsius for 3 days. 

Example 3 

Screening and Selection of Strains with Improved L-Lysine Production 

After 3 days of growth, single colonies of antibiotic resistant cells are 
individually selected to determine if there is increased L-lysine production over 
that which is produced by the parental host cell strain. 

Recipes for all media used in these experiments are found in Tables 1 
and 2. L-lysine production is determined on cultures of transformed, antibiotic 
resistant cells grown in shaker flasks. Briefly, seed media (Table 1), was 
dispensed in 20ml aliquots into deep baffled 250ml Bellco shake flasks and 
autoclaved for 20 minutes. After cooling to room temperature, these seed flasks 
were then inoculated with the strain to be tested and placed on a rotary shaker. 
They were incubated at 30 degrees Celsius, shaking, overnight. The following 
morning, the optical density (wavelength = 660nm) of each seed was recorded, 



-53- 

and 2ml of the culture from each seed flask was transferred to a 2 1 ml aliquot of 
FM3 media, also in a deep baffled shake flask. These "main" flasks were then 
returned to the shaker and incubated at 30 degrees Celsius. 

After 48 hours of incubation, 1 ml of main culture was removed from 
each flask, and the flasks were promptly returned to the shaker. From the 1 ml 
sample, optical density was determined by diluting 1:50 in 0. IN HC1 to dissolve 
the calcium carbonate present in the media. The remainder of each sample was 
then centrifuged to pellet cells and calcium carbonate. A 1:50 dilution of the 
supernatant was made in water and from this dilution the dextrose concentration 
was determined. Extracellular L-lysine concentrations were also determined at 
this time by HPLC. 

High yield derivative cells may be conveniently identified by determining 
the percent yield from dextrose, i.e., the yield of amino acid from dextrose 
defined by the formula [(g amino acid produced/ g dextrose consumed)* 100] = 
% yield. Results are presented below in which the parental strains E12, NRRL- 
Bl 1474 and ATCC 21799 are transformed with the L-lysine multi-gene isolated 
nucleic acid molecules of the invention identified as pK184-KDA, 
pKl 84-KDABH'L andpD(Elia)2-KDABHL. The pD2-KDABHL construct was 
made as in Example 4. 

lysine titer L-lysine yield Cell Deposit 



Strain Tested 


(g/L) 


(%) 




NRRL-B 11474 


31 


44 




NRRL-B 1 1 474 : :pKl 84-KDAB 


32 


45.7 


NRRL-B-30219 


NRRL-B 1 1 474 : :pKl 84-KD ABH' L 


36 


51.8 


NRRL-B-30218 


NRRL-B 1 1474: :pDElia2-KDABHL 


38 


54.6 


NRRL-B-30234 


E12 


1.4 


0.9 




E12::pKl 84-KDABH'L 


26.8 


38 


NRRL-B-30236 


E 1 2 : :pDElia2-KD ABHL 


29.8 


42.5 


NRRL-B-30237 


ATCC21799 


26.8 


36.9 




ATCC21799:: pK184-KDAB 


28.5 


39 


NRRL-B-30221 


ATCC21799:: pKl 84-KDABH'L 


31 


43 


NRRL-B-30220 


ATCC21799:: P DElia2-KDABHL 


36 


50 


NRRL-B-30235 
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Once high yield derivative cell lines are identified, the cell lines are 
further screened to determine that amplification of the amino acid biosynthetic 
pathway genes has occurred. Amplification screening may be conveniently 
accomplished either by (1) standard southern blot methodology to determine 

5 gene copy number or (2) by a determination of the total enzyme activity for 

enzymes encoded by the respective biosynthetic pathway genes of the isolated 
nucleic acid molecule introduced into the host cell. 

A determination of gene copy number by Southern blot methodology may 
be done utilizing standard procedures known in the art of recombinant DNA 

1 0 technology, as described in the laboratory manuals referenced and incorporated 

herein, for example as found in J. Sambrook, E.F. Fritsch and T. Maniatis, 
Molecular Cloning: A Laboratory Manual, 2d ed., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (1989). 



Table 1. Seed Media, SMI 


Ingredient 


Concentration (g/L) 


Sucrose 


50 


Potassium Phosphate, Monobasic 


0.5 


Potassium Phosphate, Dibasic 


1.5 


Urea 


3.0 


Magnesium Sulfate 


5.0 x lO' 1 


Polypeptone 


20 


Beef Extract 


5.0 


Biotin 


7.56 x lO" 4 


Thiamine 


3.0 x lO 3 


Niacinamide 


1.25 xlO" 1 


L-Methionine 


5.0 x. 10" 1 


L-Threonine 


2.5 x. 10" 1 


L-Alanine 


5.0 x 10 1 


pH 


7.3 
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Table 2. Main Media, FM3 


Ingredient 


Concentration (g/L) 


Dextrose* 


60 


Ammonium Sulfate 


50 


Potassium Phosphate, Monobasic 


1.0 


Magnesium Sulfate 


4.0 x 10" 1 


Manganese Sulfate 


1.0 x 10 2 


Ferrous Sulfate 


1.0 x lO" 2 


Biotin 


3.0 x lO" 4 


Calcium Carbonate 


50 


Corn Steep Liquor (dissolved solids) 


20 


pH (adjusted with KOH) 


7.4 



*Dextrose was added after autoclaving 



Example 4 

Preparation of L-Lysine Pathway Multi-Gene Constructs 

The invention farther comprises additional L-lysine multi-gene constructs 
constructed using the PCR technique. Standard PCR and subcloning procedures 
were utilized, as described above, to generate 5-gene constructs similar to those 
in Example 1 . The constructs of this example comprise the antibiotic resistance 
gene, chloramphenicol acyl transferase (CAT). The CAT gene was operably 
linked to a Corynebacteria phosphofructokinase promoter for expression in 
Coryn ebacteria . 

The following steps were performed in constructing the following 
constructs containing the CAT gene: 

1 . ^GEMY -ask-as d: -2.6 Kb PCR product containing the ask-asd 
operon of ATCC21529 using primers ask and asd was cloned into pGEM-T 
(Promega pGEM-T vector systems); 

2. pUC IS-ddh: -1.3Kb Kpnl fragment of pADM2 1 containing ddh 
(NRRL Bl 1474) was subcloned into pUC18 at the Kpnl site; 

3 . pLIC 1 .1-argS-lysA : ~3Kb PCR product using template BF 1 00 
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genomic DNA and primers argS and lysA was cloned into pPMG-LIC cloning 
vector (PharMingen); 

4. pM4-dapB-ORF2-dapA : ~3 Kb PCR product using primers dapB 
and dapA was cloned into pM4 at the blunted Xbal site; 
5 5. pFCi-ask-asd: -2.6 Kb Nsil-Apal fragment of pGEMT-ask-asd 

was cloned into pFC3 cut with PstI and Apal; 

6. pFCl-ddh: -1.3 Kb Sall-EcoRI fragment of pUC18-tMi was 
cloned into pFCl cut with Sail and EcoRI; 

7. pF 'CI -ddh-lys A: -1.5 Kb EcoRI fragment (containing the 
10 truncated lysA DNA) of plACl.l-argS-lysA was cloned into pFCl-ddh at the 

EcoRI site; 

8. pFC\ -ddh-lys A: -2.1 Kb EcoRI -Pstl fragment (containing the 
intact lysA DNA) of pRS6 was cloned into pFC\-ddh cut with EcoRI and Pstl; 

9. pFC5-dapB-ORF2-dapA: -3.4 Kb BamHI-Bglll fragment of 
1 5 pM4-dapB-ORF2-dapA was cloned into pFC5 at the BamHI site; 

10. pFC5-dapB-ORF2-dapA-ddh-'lysA: -2.8 Kb Nhel fragment of 
pFCl-ddh-lysA was cloned into pFC5-dapB-ORF2-dapA at the Nhel site; 

1 1 . pFC5-dapB-ORF2-dapA-ddh-lysA: -3.4 Kb Nhel fragment of 
pFCl-ddh-lysA was cloned into pFC5-dapB-ORF2-dapA at the Nhel site; 

20 12. pFCS-ask-asd-dapB-ORF2-dapA-ddh-'lysA (pFC 3 -KD AB H ' L) : 

-6.2 Kb NotI fragment of pFC5-dapB-ORF2-dapA-ddh-lysA was cloned into 
pFC3-ask-asd at the NotI site; 

13. pFC2-ask-asd-dapB-ORF2-dapA-ddh-lysA (pFC3-KDABHL): 
-6.8 Kb NotI fragment of pFC5- dapB-ORF2-dapA-ddh-lysA was cloned into 

25 pFCi-ask-asd at the NotI site, 

14. pK\84-ask-asd-dapB-ORF2-dapA-ddh-'lysA (pK184-KDABH'L): 
-8.8 Kb Pmel fragment of pFC3-ask-asd-dapB-ORF2-dapA -ddh- 'lysA was 
cloned into pK184 at the Hindi or Smal site, 
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15. pDElia2-ask-asd-dapB-ORF2-dapA-ddh-lysA (pD2-KDABHL): 
-9.4 Kb Pmel fragment of p¥C3-ask-asd-dapB-ORF2-dapA-ddh-lysA was 
cloned into pDElia2 at the Hindi site (contains the kan gene; is a full length 6 
gene construct); 

16. pDEliall-ask-asd-dapB-ORF2-dapA-ddh-'lysA (pDll- 
KDABH'L): -8.8 Kb Pmel fragment of p¥C3-ask-as d-dapB-ORF2 -dapA-ddh- 
'lysA was cloned into pDEliall at the Hindi or Smal site (contains the CAT 
gene; is a truncated 6 gene construct); 

17. pDElial \-ask-asd-dapB-ORF2-dapA-ddh-lysA (pDl 1-KDABHL): 
-9.4 Kb Pmel fragment of pEC3-ask-asd-dapB-ORF2-dapA-ddh-lysA was 

cloned into pDElial 1 at the Hindi site (contains the G4Tgene; is a full length 
6 gene construct); 

18. pDElia2: -1 .24Kb blunted PstI fragment of pUC4K ligated with 
the -1.75Kb Dral-Sspl fragment of pUC 19; 

19. pDElial 1: -1Kb PCR product containing the chloramphenicol 
acyl-transferase gene expressed by the C. glutamicum fda promoter was obtained 
using primers UCdral and UCsspI and pM4 as template and was ligated with the 
-1.75Kb Dral-Sspl fragment of pUC19; 

The primers utilized for the cloning procedures included: 
ask: 5'-GGGTACCTCGCGAAGTAGCACCTGTCAC-3' 
asd: 5'-GCGGATCCCCCATCGCCCCTCAAAGA-3' 

dapB: 5'-AACGGGCGGTGAAGGGCAACT-3' 
dapA : 5-TGAAAGACAGGGGTATCCAGA-3' 

ddh\ 5-CCAT GGTACC AAGTGCGTGGCGAG-3 ' 

ddhl 5-CCAT GGTACC ACACTGTTTCCTTGC-3' Kpn I sites: GGTACC 



argS: 5'-CTGGTTCCGGCGAGTGGAGCCGACCATTCCGCGAGG-3' 
lysA: 5'-CTCGCTCCGGCGAGGTCGGAGGCAACTTCTGCGACG-3 ' 
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a primer that anneals internally to lysA (about 500bp upstream to the end of 
lysA). 

UCdral 5-GGATCTTCACCTAGATCC 
UCsspI5'-CCCTGATAAATGCTTC 

"K", "D", "A", "B," "H," "L" and '"L" have the same designations as set 
forth above. 

Example 5 

Three-Fold Amplification of L-lysine Amino Acid Biosynthesis 
Pathway Genes 

For exemplary purposes only, Applicants provide herein an example 
wherein at least one L-lysine amino acid biosynthesis pathway gene is amplified 
by a factor of 3. 

Plasmid pDl 1-KDABH'L (constructed in Example 4) was used in the 
construction of high yield derivative cell lines of the invention. For cell 
transformation experiments with the isolated nucleic acid molecules of the 
invention, the growth preparation of competent cells, and determining of relative 
growth may be done according to the procedure set forth above. 

Plasmid pDl 1-KDABH'L DNA was introduced into NRRL-B30220 
(comprising pKl 84-KDABH'L), using the electroporation method above. 
Introduction of the pD 1 1 -KDABH'L plasmid DNA into NRRL-B3 0220 resulted 
in incorporation of one copy of pDl 1-KDABH'L into the host cell chromosome 
via a single crossover homologous recombination event. The host cell comprising 
two copies of five genes (pDl 1-KDABH'L and pKl 84-KDABH'L) has been 
deposited as NRRL-B30222. 

The amount of lysine produced by C. glutamicum ATCC 2 1 799 host cells 
having 3 copies of 5 genes (one endogenous copy and one copy of each of pDl 1 - 
KDABH'L and pKl 84-KDABH'L) is shown below. 
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L-lysine Production 

Strains L-Iysine titer (g/L) L-lysine yield (%) 

ATCC 21799 26.6 45.0 

NRRL-B30222 32.0 56.0 

5 Example 6 

This example describes changing the promoter to increase the level of 
expression of each of these 6 genes described above. Six genes encoding six 
different enyzmes of the biosynthetic pathway from L-aspartate to L-lysine have 
been inserted onto the chromosome of Corynebacterium glutamicum. The 

10 additional copy of each gene is from a C. glutamicum strain. The nucleotide 

sequences that regulate the level of expression (promoter) for each gene were the 
same as found on the C. glutamicum chromosome at the native loci. 

Increased expression can result in increased specific activities of the 
enzymes and improved flux of carbon from aspartate to lysine. The yield of 

1 5 lysine from glucose can be improved by this technique. 

The level of expression from a promoter sequence is referred to as 
strength. A strong promoter gives higher expression than a weak one. The 
mechanisms that determine the strength of a promoter have been described 
(Record, M.T., et al, "Escherichia coli RNA Polymerase, Promoters, and the 

20 Kinetics of the Steps of Transcription Initiation," in Escherichia coli and 

Salmonella: Cellular and Molecular Biology, ASM Press (1996), pp. 792-881). 
Sources of promoters include nucleotide sequences from the 5' end of genes 
native to the C. glutamicum chromosome, from sequences on plasmids that 
replicate in C. glutamicum, from sequences in the genome of phage that infect 

25 C. glutamicum, or from sequences assembled by humans (tac, trc) and are not 

found in nature. Genes of ribosomal proteins, ribosomal RNAs and elongation 
factors show high levels of expression. The promoters of these genes are 
candidates for increasing expression of amino acid biosynthetic pathway genes. 
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Anotherreason for changing promoters of genes in biosynthetic pathways 
is to make the pathway independent of factors that control the pathway in the 
wild type organism. For example the native promoter of the operon that contains 
diaminopimelate decarboxylase of the lysine biosynthetic pathway of C. 

5 glutamicum can respond to arginine or lysine in the growth medium. Arginine 

increased transcription three-fold and lysine decreased transcription by one third 
(Oguiza, etal, JBact. 775:7356-7362(1993)). Diaminopimelate decarboxylase 
activity decreased 60% in cells grown in minimal medium supplemented with 
lOmmM lysine (Cremer et ah, J Gen Microbiol 754:3221-3229 (1988)). 

10 Replacing the promoter of lysA which encodes the diaminopimelate 

decarboxylase is one way to make lysine biosynthesis independent of arginine 
and lysine levels in media. 



Example 6A 



Shown below are examples of promoters that are stronger than the askPl 
1 5 promoter which regulates the gene for aspartate kinase, the first enzyme in the 

pathway from aspartate to lysine. 



Beta-Galactosidase Assay of Candidate Promoters 



Candidate 


Specific Activity 
micro mol/'min/mg 


Origin 


E12 


0.20 


no promoter 


E12/pTAC 


49.80 


pKK223-3 


BF100 


0.08 


no promoter 


BF100/pAD151.1 


2.22 


aspartokinase PI 


E12 


0.11 


no promoter 


E12/pAD151.1 


1.96 


aspartokinase PI 


E12/5 


3.46 


BF100 genome 


E12/7 


.8.60 


BF100 genome 


E12/10 


6.56 


BF100 genome 


El 2/32 


3.11 


BF100 genome 


E12/3 


22.00 


corynephage 


E12/39 


11.57 


corynephage 
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| El 2/42 | 10.90 | corynephage | 

El 2 is a C. glutamicum strain that does not produce lysine. El 2 is a laboratory 
strain derived from ATCC 13059. BF 100 is a high level lysine producer (NRRL- 
Bl 1474). TAC is commercially available promoter that has been used as an 
example of a strong promoter. Four promoters from the C. glutamicum 
chromosome and three from a phage have been identified that are stronger than 
the native aspartokinase promoter. 

Example 6B 

Examples of strong promoters increasing specific enzyme activity of 
aspartokinase when expressed in C. glutamicum are shown below. 



Influence of IPTG on Aspartokinase activity 



Strain 


Regulator/promoter-gene 


Inducer 


nmol/min/mg 


BF100 


none 


none 


110 


?~D9trc-ask 


laclltrc-ask 


none 


103 


FD9trc-ask 


laclltrc-ask 


+IPTG (30 mg/L) 


269 


131-2 


laclltrc-ask 


none 


59 


131-2 


laclltrc-ask 


+IPTG(30mg/L) 


117 


131-5 


laclltrc-ask 


none 


59 


131-5 


laclltrc-ask 


+IPTG (30 mg/L) 


123 


pD9 is a plasmid that replicates in C. glutamicum. 


131 strains have the trc-ask construct integrated into the genome. 


IPTG induces genes controlled by the TRC promoter. 
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Example 6C 

Examples of the influence of lacl/trc-ask on lysine production in shake 
flasks are shown below. 



Strain 


Induction 


O.D. 


Titre 


Yield 


S.P. 


BF100 


none 


46 


26 


43 


58 


?D9trc-ask 


none 


49 


30 


49 


61 


?D9trc-ask 


+IPTG 


45 


30 


50 


68 


BF100 


none 


43 


23 


39 


53 


131-2 




34 


27 


46 


82 


131-5 


none 


35 


28 


47 


82 



O.D. = optical density at 660nm 



Titre = grams Lysine/liter 

Yield = grams lysine made/grams dextrose consumed 

S.P. = grams lysine/O. D . 

The production of lysine by BF100 was improved by increasing the 
strength of the aspartokinase promoter. 

Example 7 

This example demonstrates the use of vector pDElia2- ask-as d-dapA- 
ORF2-dapB-ddh-PllysA (pDElia2KDABHP 1 L) in the construction of the high 
yield cell lines of the invention. The Hpal-Pvall fragment containing the PI 
promoter was prepared as described in Marcel T. , et ah, Molecular Microbiology 
4:1819-1830 (1990). Applicants utilized standard PCR and subcloning 
procedures as set forth above. For cell transformation experiments with the 
isolated nucleic acid molecules of the invention, the growth preparation of 
competent cells, and determining or relative growth may be done according to 
the procedure set forth above. 
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Applicants performed the following steps in constructing the following 
vectors used in the L-lysine biosynthetic pathway. 

1. pGEMT -as k-asd: -2.6 Kb PCR product containing the ask-asd 
operon of ATCC21529 using primers ask and asd was cloned into pGEM-T 
(Promega pGEM-T vector systems). 

2. p\JC\S-ddh: -1.3 Kpnl fragment of pADM21 containing ddh 
(BF100 locus) was subcloned into pUC18 at the Kpnl site. 

3 . pFCi-ask-asd: -2. 6 Kb Nsil-Apal fragment of pGEMT-ask-asd 
was cloned into pFC3 cut with PstI and Apal. 

4. pFC3-dapB-ORF2-dapA: -2.9 Kb PCRproduct ofNRRL-B 1 1474 
dapB-ORF2-dapA coding region was cloned into pFC3 at the EcoRV site. 

5. pFC\-ddh: -1.3 Kb Pstl-EcoRI fragment of pUC\S-ddh was 
cloned into pFCl cut with PstI and EcoRI. 

6. pUC19-Pl: -550 bp Hpal-PvuII fragment (containing the first 
promoter, PI, of the argS-lysA operon) of pRS6 was cloned into pUC19 at the 
Smal site. 

7. pUC19-Pl/y^: ~1.45KbpromoterlessPCRproduct,usingprimer 
Ly^(ATG) and LysA3B, of NRRL-B1 1474 lysA coding region is cloned into 
pUC 1 9-P 1 at the Hindi site. 

8 . pFC 1 -P 1 lysA : -2 Kb EcoRI-Hincffll fragment of pUC 1 9-P 1 lysA 
was cloned into pFCl cut with EcoRI and Hindlll. 

9. pFCl-F UysA-ddh: -1.3 Kb EcoRI-NotI fragment of pFC\-ddh 
was cloned into pFCl-PllysA cut with EcoRI and Notl. 

10. pFC\-ask-asd-ddh-?UysA: ~2.6Kb Swal-Fsel fragment ofpFC3- 
as k-asd was cloned into pFC\-ddh-?\lysA cut with Swal and Fsel. 

11. pFC2>-ask-asd-dapB-ORF2-dapA-ddh-F\lysA (pFC3- 
KDABHP1L): -5.9 Kb Spel fragment of pFCl -as k-asd-ddh-f I lysA was cloned 
into pFCi-dapB-ORF2-dapA at the Spel site. 

12. -pDE\ia2-ask-asd-dapB-ORF2-dapA-ddh-VllysA (pDElia2- 
KDABHP1L): -8.8 Kb Pmel fragment of pFC3 -ask-as d-dapB-ORF2-dapA-ddh- 
FUysA was cloned into pDElia2 at the HincII site. 
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Primers used in PCR: 

lysA (ATG) : CCGGAGAAGATGTAACAATGGCTAC 
LysA3B : CCTCGACTGCAGACCCCTAGACACC 

The nucleotide sequence (SEQ ID NO: 17) of the Hpal-PvuII fragment 
containing the promoter PI is shown in figure 20. Results of lysine production 
in NRRL-B11474 comprising the pDE\isQ.-ask-asd-dapA-ORF2-dapB-ddh- 
PllysA (pDElia2 KDABHP1L) construct are shown below. 

Strain tested lysine lysine yield (%) cell deposit 
titer 

NRRL-B 11474 30 35 

NRRL-B 1 1 474 : :pDElia2-KD ABHP 1 L 37 42.8 NRRL B30359 

Example 8 

This example demonstrates the use of vector pDElia2 FC5 -ask-asd-dapB- 
ddh-lysA (pDElia2 FC5 KDBHL) in the construction of the high yield cell lines of 
the invention. The pDElia2 FC5 KDBHL vector comprises atruncated ORF2 gene 
and lacks a dapA gene. The ORF2 gene was cleaved at an internal Clal site, 
removing the 3' region and the dapA gene. A promoterless lysA gene was 
obtained from NRRL-B 11 474. For cell transformation experiments with the 
isolated nucleic acid molecules of the invention, the growth preparation of 
competent cells, and determining of relative growth may be done according to 
the procedure set forth above. Applicants performed the following steps in 
constructing the following vectors used in the L-lysine biosynthetic pathway. 

1. pGEMT -as k-asd: -2.6 Kb PCR product containing the ask-asd 
operon of ATCC21529 using primers ask and asd was cloned into pGEM-T 
(Promega pGEM-T vector systems). 

2. pPCS-ask-asd: -2.6 Kb Nsil-Apal fragment of pGEMT-ask-asd 
was cloned into pFC3 cut with PstI and Apal. 
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3. V FC3-dapB-ORF2-dapA: -2.9 Kb PCR product ofNRRL-Bl 1474 
dapB-ORF2-dapA coding region was cloned into pFC3 at the EcoRV site. 

4. v¥C3-dapB: the large Clal fragment of pFC3-dapB-ORF2-dapA 
was religated. 

5 5. p\JC\8-ddh: -1.3 Kb Kpnl fragment of pADM21 containing ddh 

(NRRL-B1 1474 locus) was subcloned into pUC18 at the Kpnl site. 

6. pFCl-ddh: -1.3 Kb Sall-EcoRI fragment of pUC18-cW/j was 
cloned into pFCl cut with Sail and EcoRI. 

7. pFCl-ddh-lysA: -2.1 Kb EcoRI-PstI fragment (containing the 
1 0 intact lysA DNA) of pRS6 was clone into pFC 1 -ddh cut with EcoRI and Pstl. 

8. pFCl-ask-asd-ddh-lysA: -2.6 Kb Swal-Fsel fragment of pFC3- 
ask-asd was cloned into pFCl-ddh-lysA cut with Swal and Fsel. 

9 . pFC3-ask-asd-dapB-ddh-lysA : -6 Kb Spel fragment of pFC 1 -ask- 
as d-ddh-lys A was cloned into pFC3 -dapB at the Spel site. 

15 10. pDElia2 TC5 -ask-asd-dapB-ddh-lysA (pDElia2 FC5 -KDBHL): -7.3 

Kb Notl-Pmel fragment of TpFC3-ask-asd-dapB-ddh-lysA was cloned into 

pDElia2 FC5 cut with NotI and Pmel. 

1 1 . pDElia2 FC5 : the small PvuII fragment of pFC5 was ligated with the 

large PvuII fragment of pDElia2. 
20 Results of lysine production in NRRL-B 11474 comprising the 

pDElia2 TC5 -ask-asd-dapB-ddh-lysA (pDElia2 FC5 KDBHL) are shown below. 



Strain tested 



lysine titer lysine yield (%) cell deposit 



NRRL-B 11474 

NRRL-B 1 1474: :pDElia2 FC5 -KDBHL 



31 
37.8 



49 
58 



NRRL B30360 



* * * * * 



Having now fully described the present invention in some detail by way 
of illustration and example for purposes of clarity of understanding, it will be 
obvious to one of ordinary skill in the art that same can be performed by 
modifying or changing the invention with a wide and equivalent range of 
conditions, formulations and other parameters thereof, and that such 
modifications or changes are intended to be encompassed within the scope of the 
appended claims. 

All publications, patents and patent applications mentioned in this 
specification are indicative of the level of skill of those skilled in the art to which 
this invention pertains, and are herein incorporated by reference to the same 
extent as if each individual publication, patent or patent application was 
specifically and individually indicated to be incorporated by reference. 



-67- 



What Is Claimed Is: 

1 . An isolated polypeptide comprising the amino acid s equence of SEQ ID 
NO:2. 

2. An isolated polynucleotide molecule comprising a nucleotide sequence 
encoding the polypeptide sequence of claim 1 . 

3. The isolated polynucleotide molecule of claim 2 comprising a nucleic 
acid having the sequence of SEQ ED NO: 1 . 

4. A vector comprising the isolated polynucleotide molecule of claim 2. 

5. A host cell comprising the vector of claim 4. 

6. A method comprising: 

(a) transforming a Corynebacterium species host cell with the 
polynucleotide molecule of claim 2, wherein said isolated polynucleotide 
molecule is integrated into said host cell's chromosome thereby increasing the 
total number of said amino acid biosynthetic pathway genes in said host cell 
chromosome, and 

(b) selecting a transformed host cell. 

7 . The method of claim 6 further comprising s creening for increased amino 
acid production. 

8. The method of claim 6 wherein said polynucleotide molecule further 
comprises at least one of the following: 

(a) a nucleic acid molecule encoding a Corynebacterium species 
lysine pathway asd amino acid sequence; 
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(b) a nucleic acid molecule encoding a Corynebacterium species 
lysine pathway dapA amino acid sequence; 

(c) a nucleic acid molecule encoding a Corynebacterium species 
lysine pathway dapB amino acid sequence; 

5 (d) a nucleic acid molecule encoding a Corynebacterium species 

lysine pathway ddh amino acid sequence; and 

(e) a nucleic acid molecule encoding a Corynebacterium species 
lysine pathway 'lysA amino acid sequence; 

(f) a nucleic acid molecule encoding a Corynebacterium species 
10 lysine pathway lysA amino acid sequence; and 

(g) a nucleic acid molecule encoding a Corynebacterium species 
lysine pathway ORF2 amino acid sequence. 

9. The method of claim 8 further comprising screening for increased amino 
acid production. 

15 10. The method of claim 6, wherein said isolated polynucleotide molecule 

further comprises at least one of the following: 

(a) a nucleic acid molecule encoding the asd amino acid sequence of 
SEQ ID NO:4; 

(b) a nucleic acid molecule encoding the dapA amino acid sequence 
20 of SEQ ID NO: 6; 

(c) a nucleic acid molecule encoding the dapB amino acid sequence 
of SEQ ID NO: 8; 

(d) a nucleic acid molecule encoding the ddh amino acid sequence 
of SEQ ID NO: 10; 

25 (e) a nucleic acid molecule encoding the 'lysA amino acid sequence 

of SEQ ID NO:21; 

(f) a nucleic acid molecule encoding the lysA amino acid sequence 
of SEQ ID NO: 14; 
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(g) a nucleic acid molecule encoding the ORF2 amino acid sequence 
ofSEQ ID NO: 16. 

1 1 . The method of claim 6, wherein said isolated polynucleotide molecule 
further comprises the following: 

(a) a nucleic acid molecule encoding the asd amino acid sequence of 
SEQ IDNO:4; 

(b) a nucleic acid molecule encoding the dapA amino acid sequence 
of SEQ ID NO:6; 

(c) a nucleic acid molecule encoding the dapB amino acid sequence 
of SEQ ID NO: 8; and 

(d) a nucleic acid molecule encoding the ORF2 amino acid sequence 
of SEQ ID NO: 16. 

12. The method of claim 6, wherein said isolated polynucleotide molecule 
further comprises the following: 

(a) a nucleic acid molecule encoding the as d amino acid sequence of 
SEQ ID NO:4; 

(b) a nucleic acid molecule encoding the dapA amino acid sequence 
of SEQ IDNO:6; 

(c) a nucleic acid molecule encoding the dapB amino acid sequence 
of SEQ ID NO: 8; 

(d) a nucleic acid molecule encoding the ddh amino acid sequence 
of SEQ ID NO: 10; and 

(e) a nucleic acid molecule encoding the ORF2 amino acid sequence 
of SEQ ID NO: 16. 

13. The method of claim 6, wherein said isolated polynucleotide molecule 
further comprises the following: 

(a) a nucleic acid molecule encoding the asd amino acid sequence of 
SEQ ID NO:4; 
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(b) a nucleic acid molecule encoding the dapA amino acid sequence 
ofSEQ ID NO: 6; 

(c) a nucleic acid molecule encoding the dapB amino acid sequence 
ofSEQ ID NO: 8; 

(d) a nucleic acid molecule encoding the ddh amino acid sequence 
ofSEQ ID NO: 10; 

(e) a nucleic acid molecule encoding the 'fysA amino acid sequence 
of SEQ ID NO:21; and 

(f) a nucleic acid molecule encoding the ORF2 amino acid sequence 
of SEQ ID NO: 16. 

14. The method of claim 6, wherein said isolated polynucleotide molecule 
further comprises the following: 

(a) a nucleic acid molecule encoding the asd amino acid sequence of 
SEQ IDNO:4; 

(b) a nucleic acid molecule encoding the dapA amino acid sequence 
of SEQ ID NO: 6; 

(c) a nucleic acid molecule encoding the dapB amino acid sequence 
of SEQ ID NO: 8; 

(d) a nucleic acid molecule encoding the ddh amino acid sequence 
of SEQ ID NO: 10; 

(e) a nucleic acid molecule encoding the lysA amino acid sequence 
of SEQ ID NO: 14; and 

(f) a nucleic acid molecule encoding the ORF2 amino acid sequence 
of SEQ ID NO: 16. 

15. The method of claim 6 further comprising: 

(a) growing said transformed host cell in a medium; and 

(b) purifying an amino acid produced by said transformed host cell. 
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1 6. An isolated polynucleotide molecule comprising: 

(a) the polynucleotide molecule of claim 2; and 

(b) at least one additional Corynebacterium species lysine pathway 
gene selected from the group consisting of: 

(i) a nucleic acid molecule encoding an asd polypeptide; 

(ii) a nucleic acid molecule encoding a dapA polypeptide; 

(iii) a nucleic acid molecule encoding a dapB polypeptide, 

(iv) a nucleic acid molecule encoding a ddh polypeptide; 

(v) a nucleic acid molecule encoding a 'lysA polypeptide; 

(vi) a nucleic acid molecule encoding a lysA polypeptide; and 

(vii) a nucleic acid molecule encoding an ORF2 polypeptide. 

17. The isolated nucleic acid molecule of claim 16, wherein: 

(a) said asd polypeptide is SEQ ID NO:4; 

(b) said dapA polypeptide is SEQ ID NO: 6; 

(c) said dapB polypeptide is SEQ ID NO:8; 

(d) said ddh polypeptide is SEQ ID NO: 1 0; 

(e) said 'lysA polypeptide is SEQ ID NO: 2 1 ; 

(f) said lysA polypeptide is SEQ ID NO: 14; and 

(g) said ORF2 polypeptide is SEQ ID NO: 16. 

18. An isolated polynucleotide molecule comprising: 

(a) the polynucleotide molecule of claim 2; 

(b) a nucleic acid molecule encoding the asd amino acid sequence of 
SEQ ID NO:4; 

(c) a nucleic acid molecule encoding the dap A amino acid sequence 
of SEQ ID NO:6; 

(d) a nucleic acid molecule encoding the dapB amino acid sequence 
of SEQ ID NO: 8; and 

(e) a nucleic acid molecule encoding the ORF2 amino acid sequence 
of SEQ ID NO: 16. 
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19. An isolated polynucleotide molecule comprising: 

(a) the polynucleotide molecule of claim 2; 

(b) a nucleic acid molecule encoding the as d amino acid sequence of 
SEQ ID NO:4; 

(c) a nucleic acid molecule encoding the dap A amino acid sequence 
of SEQ ID NO:6; 

(d) a nucleic acid molecule encoding the dapB amino acid sequence 
of SEQ ID NO: 8; 

(e) a nucleic acid molecule encoding the ddh amino acid sequence 
of SEQ ID NO: 10; and 

(f) a nucleic acid molecule encoding the ORF2 amino acid sequence 
of SEQ ID NO: 16. 

20. An isolated polynucleotide molecule comprising: 

(a) the polynucleotide molecule of claim 2; 

(b) a nucleic acid molecule encoding the asd amino acid sequence of 
SEQ ID NO:4; 

(c) a nucleic acid molecule encoding the dapA amino acid sequence 
of SEQ ID NO: 6; 

(d) a nucleic acid molecule encoding the dapB amino acid sequence 
of SEQ ID NO: 8; 

(e) a nucleic acid molecule encoding the ddh amino acid sequence of 
SEQ ID NO: 10; 

(f) a nucleic acid molecule encoding the 'lysA amino acid sequence 
of SEQ IDNO:21; and 

(g) a nucleic acid molecule encoding the ORF2 amino acid sequence 
of SEQ ID NO: 16. 
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21. An isolated polynucleotide molecule comprising: 

(a) the polynucleotide molecule of claim 2; 

(b) a nucleic acid molecule encoding the as d amino acid sequence of 
SEQ ID NO:4; 

(c) a nucleic acid molecule encoding the dapA amino acid sequence 
of SEQ ID NO: 6; 

(d) a nucleic acid molecule encoding the dapB amino acid s equence 
of SEQ ID NO: 8; 

(e) a nucleic acid molecule encoding the ddh amino acid sequence of 
SEQ ID NO: 10; 

(f) a nucleic acid molecule encoding the lysA amino acid sequence 
of SEQ ID NO: 14, and 

(g) a nucleic acid molecule encoding the ORF2 amino acid sequence 
of SEQ ID NO: 16. 

22. The isolated polynucleotide molecule of claim 18 comprising pK184- 
KDAB. 

23. The isolated polynucleotide molecule of claim 20 comprising pDll- 
KDABH'L. 

24. The isolated polynucleotide molecule of claim 21 comprising pD2- 
KDABHL. 

25. A vector comprising the polynucleotide molecule of claim 16. 

26. A host cell comprising the vector of claim 25. 

27. The host cell of claim 26 wherein said host cell is a Brevibacteriwn 
selected from the group consisting of Brevibacteriwn flavurn NRRL-B30218, 
Brevibacteriwn flavum NRRL-B30219, Brevibacteriwn lactofermentwn 
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NRRL-B30220, Brevibacterium lactofermentum NRRL-B3022 1 , Brevibacterium 
lactofermentum NRRL-B30222, Brevibacterium flavum NRRL-30234 and 
Brevibacterium lactofermentum NRRL-30235. 

28 . The host cell of claim 26 wherein said host cell is Escherichia coli DH5 
5 a MCR NRRL-B30228. 



29. The host cell of claim 26 wherein said host cell is a C. glutamicum 
selected from the group consisting of C. glutamicum NRRL-B30236 and C. 
glutamicum NRRL-B30237. 

30. A method of producing lysine comprising culturing the host cells of 
10 claim 5 wherein said host cells comprise one or more of: 

(a) increased enzyme activity of one or more lysine biosynthetic 
pathway enzymes compared to the genetically unaltered host cell; 

(b) one or more copies of each gene encoding a lysine biosynthetic 
pathway enzyme; and, 

15 (c) alteration of one or more transcription factors regulating 

transcription of one or more genes encoding a lysine biosynthetic pathway 
enzyme, wherein said host cell produces lysine in said culture medium. 



31. The method of claim 30 wherein said increased enzyme activity 
comprises overexpressing one or more genes encoding one or more lysine 

20 biosynthetic pathway enzymes. 

32. The method of claim 31 wherein said one or more genes are operably 
linked directly or indirectly to one or more promoter sequences. 

33 . The method of claim32 wherein said operably linkedpromoter sequences 
are heterologous, endogenous, or hybrid. 
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34. The method of claim33 wherein said promoter sequences are one or more 
of: a promoter sequence from the 5' end of genes endogenous to C. glutamicwn, 
a promoter sequence from plasmids that replicate in C glutamicwn, and, a 
promoter sequence from the genome of phage which infect C. glutamicum. 

5 35. The method of claim 33 or 34 wherein one or more of said promoter 

sequences are modified. 



36. The method of claim 3 5 wherein said modification comprises truncation 
at the 5' end, truncation at the 3' end, non-terminal insertion of one or more 
nucleotides, non-terminal deletion of one or more nucleotides, addition of one or 
10 more nucleotides at the 5' end, addition of one or more nucleotides at the 3' end, 

and, combinations thereof. 



37. The method of claim 30 wherein said increased enzyme activity results 
from the activity of one or more modified lysine biosynthetic pathway enzymes 
wherein said enzyme modification results in a change in kinetic parameters, 

1 5 allosteric regulation, or both, compared to the enzyme lacking said modification. 

38. The method of claim 37 wherein said change in kinetic parameters is a 
change in K™, V max or both. 

39. The method of claim 37 wherein said change in allosteric regulation is a 
change in one or more enzyme allosteric regulatory sites. 

20 40. The method of claim 37 wherein said modification is a result of a change 

in the nucleotide sequence encoding said enzyme. 

4 1 . The method of claim 40 wherein said change in said nucleotide sequence 
is an addition, insertion, deletion, substitution, or a combination thereof, of one 
or more nucleotides. 
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42. The method of claim 30 wherein said alteration of one or more 
transcription factors comprises one or more mutations in transcription inhibitor 
proteins, one or more mutations in transcription activator proteins, or both, 
wherein said one or more mutations increases transcription of the target 
nucleotide sequence compared to the transcription by said one or more 
transcription factors lacking said alteration. 

43 . The method of claim 42 wherein said one or more mutations is a change 
in said nucleotide sequence encoding said transcription factor. 

44. The method of claim 43 wherein said change in said nucleotide sequence 
is an addition, insertion, deletion, substitution, or a combination thereof, of one 
or more nucleotide. 

45. An isolated polypeptide, wherein said polypeptide comprises an amino 
acid sequence having at least 95% sequence identity to the amino acid sequence 
ofSEQ ID NO: 19. 

46. The polypeptide of claim 45, wherein said polypeptide has the amino acid 
sequence of SEQ ID NO: 1 9. 

47. An isolated polynucleotide molecule comprising a nucleotide sequence 
encoding the polypeptide of claim 45. 

48. The isolated polynucleotide molecule of claim 47 comprising a nucleic 
acid having the sequence of SEQ ID NO: 18. 

49. A vector comprising the polynucleotide molecule of claim 47. 

50. A host cell comprising the vector of claim 49. 

51 . The host cell of claim 50 wherein said host cell is NRRL B30360. 
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52. A method comprising: 

(a) transforming a Corynebacterium species host cell with the 
polynucleotide molecule of claim 47, and 

(b) selecting a transformed host cell. 

53. An isolated polypeptide wherein said polypeptide comprises a 
polypeptide having at least 95% sequence identity to the amino acid sequence of 
SEQ ID NO:21. 

54. The polypeptide of claim 53 having the amino acid sequence of SEQ ID 
NO:21. 

55. An isolated polynucleotide molecule comprising a nucleotide sequence 
encoding the polypeptide of claim 53. 

56. The isolated polynucleotide molecule of claim 55 comprising a nucleic 
acid having the sequence of SEQ ID NO: 20. 

57. A vector comprising the polynucleotide molecule of claim 55. 

58. A host cell comprising the vector of claim 57. 

59. The host cell of claim 58 wherein said host cell is selected from the group 
consisting of NRRL B30218, NRRL B30220 and NRRL B30222. 

60. A method comprising: 

(a) transforming a Corynebacterium species host cell with the 
polynucleotide molecule of claim 55, and 

(b) selecting a transformed host cell. 
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61. The isolated polynucleotide molecule of claim 2 further comprising a 
promoter sequence where said promoter sequence has at least 95% sequence 
identity to SEQ ID NO: 17. 

62. The polynucleotide of claim 61 where said promoter sequence has the 
nucleotide sequence of SEQ ID NO: 17. 

63 . The isolated polynucleotide molecule of claim 6 1 wherein said promoter 
is operably directly linked to the LysA gene. 

64. A vector comprising the isolated polynucleotide of claim 6 1 . 

65. A host cell comprising the vector of claim 64. 

66. The host cell of claim 65 wherein said host cell is NRRL B30359. 

67. A method comprising: 

(a) transforming a Corynebacterium species host cell with the 
polynucleotide molecule of claim 61, and 

(b) selecting a transformed host cell. 
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Increased Lysine Production by Gene Amplification 
Abstract 

The invention provides methods to increase the production of an amino 
acid from Corynebacterium species by way of the amplification of amino acid 
biosynthetic pathway genes in a host cell chromosome. In a preferred 
embodiment, the invention provides methods to increase the production of L- 
lysine in Corynebacterium glutamicum by way of the amplification of L-lysine 
biosynthetic pathway genes in a host cell chromosome. The invention also 
provides novel processes for the production of an amino acid by way of the 
amplification of amino acid biosynthetic pathway genes in a host cell 
chromosome and/or by increasing promoter strength. In a preferred embodiment, 
the invention provides processes to increase the production of L-lysine in 
Corynebacterium glutamicum by way of the amplification of L-lysine 
biosynthetic pathway genes in a host cell chromosome. The invention also 
provides novel isolated nucleic acidmolecules for L-lysine biosynthetic pathway 
genes of Corynebacterium glutamicum. 
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Nucleotide 



Sequence oC ATCC21529 ask (SEQ ID NO:l> 



1 GTCGCCCTCC TCGTACAGAA atatgocggt tcctcgcttg agagtgcgga 
'.)] AC.GCATTAGA AACGTCGCTG AACGGATCGT TGCCACCAAG AAGGCTGGAA 
101 ATGATGTCGT GGTTGTCTGC TCCGCAATGG GAGACACCAC GGATGAACTT 
i^l CTAGAACTTG CAGCGGCAGT GAATCCCGTT CCGCCAGCTC GTGAAATGGA 
201 TATGCTCCTG ACTGCTGGTG AGCGTATTTC TAACGCTCTC GTCGCCATGG 
251 CTATTGAGTC CCTTGGCGCA GAAGCTCAAT CTTTCACTGG CTCTCAGGCT ' 
301 GGTGTGCTCA CCACCGAGCG CCACGGAAAC GCACGCATTG TTGACGTCAC 
351 ACCGGGTCGT GTGCGTGAAG CACTCGATGA GGGCAAGATC TGCATTGTTG 
401 CTGGTTTTCA GGGTGTTAAT AAAGAAACCC GCGATGTCAC CACGTTGGGT 
451 CGTGGTGGTT CTGACACCAC TGCAGTTGCG TTGGCAGCTG CTTTGAACGC 
501 TGATGTGTGT GAGATTTACT CGGACGTTGA CGGTGTGTAT ACCGCTGACC 
551 CGCGCATCGT TCCTAATGCA CAGAAGCTGG AAAAGCTCAG CTTCGAAGAA 
601 ATGCTGGAAC TTGCTGCTGT TGGCTCCAAG ATTTTGGTGC TGCGGAGTGT 
651 TGAATACGCT CGTGCATTCA ATGTGCCACT TCGCGTACGC TCGTCTTATA 
"701 GTAATGATCC CGGCACTTTG ATTGCCGGCT CTATGGAGGA TATTCCTGTG 
751 GAAGAAGCAG TCCTTACCGG TGTCGCAACC GACAAGTCCG AAGCCAAAGT 
801 AACCGTTCTG GGTATTTCCG ATAAGCCAGG CGAGGCTGCC AAGGTTTTCC 
B51 GTGCGTTGGC TGATGCAGAA ATCAACATTG ACATGGTTCT GCAGAACGTC 
901 TCCTCTGTGG AAGACGGCAC CACCGACATC ACGTTCACCT GCCCTCGCGC 
951 TGACGGACGC CGTGCGATGG AGATCTTGAA GAAGCTTCAG GTTCAGGGCA 
1001 ACTGGACCAA TGTGCTTTAC GACGACCAGG TCGGCAAAGT CTCCCTCGTG 
1051 GGTGCTGGCA TGAAGTCTCA CCCAGGTGTT ACCGCAGAGT TCftTGGAAGC 
1101 TCTGCGCGAT GTCAACGTGA ACATCGAATT GATTTCCATC TCTGAGATCC 
1151 GCATTTCCGT GCTGATCCGT GAAGATGATC TGGATGCTGC TGCACGTGCA 
1201 TTGCATGAGC AGTTCCAGCT GGGCGGCGAA GACGAAGCCG TCGTTTATGC 
12 51 AGGCACCGGA CGCTAA 
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GTGCCCCTGCTCGTACAGAAATATGGCGGTTCCTCGCTTGACAGTGCGGAACC-rATTACA 

A I- V V O K Y C C S S I. E S A. £ P : B 

A.-.CGTCGCTGAACGGATCGTTGCCACCAAGAAGGCTCGAAATGATGTCGTGG'rrGTCTGC 
01 .. — -* H * » 1J0 

pVAEBIVftTkKACNDvv«\?C 

TCCGCPATGQGAGRCACCACGChTGflACTTCTAGAACTTGCAGCGGCAGTGAA^CCCGTT 
121 * * ♦ , 1 + lg0 

S AMGDTT DEL LELAAAVN p V 

CCGCcaGCTCGTGAMTGGATATGCTCCTGACTGCTGGTGAGCGTATTTCTAACGCTCTC 

181 . 4- + +- + + - + 240 

PPAREMDMLLTAGER1SNAL 

CTGAGTCCCTTGGCGCAGAAGCrrCAATCTTTCACTGGC 

VAMA1ESLGAEAQSFTG 



GV^TTERHGNARIVpVTPGR 

GTGCGtGAAGCACTCGATGAGGGCAAGAICTGCATXGTTGCTGGTTTTCAGGGTGTTAAT 
I + , . .+ +— + + — • — + A 

VREAIjDEGKICIVAGFQGVN 

AAAGAAACCCGCGATGTCACCACGTTGGGTCGTGGTGGTTCTGACACCRCTGCAGTTGCG 
I .. +. — + + + + + 4 

KETRDVTTLGRGGSDTTAVA 

TTGGCAGCTGCXTTGAACGCTGATGTGTGTGAGATTTACTCGGACGTTGACGGTGTG'TAT 
X + + + + +■ + S 

LAAALNADVCEIVSOVOGVY 1 

ACCGCTGACCCGCGCATCGTTCCTAATGCACAGMGCTGGAAAAGCTCAGCTTCC-AAGAA 
H , . ^ + -__- + f .+ + 

TADPRIVPNAQKLEKLSFSE 



L E L A A 



rafnvplrvrssvsndpgti- 
attgccggctctatggaggatattcctgtggaagaagcagtccttaccggtgtcgcaacc 

721 -_+-.,. .---+ ,. , + • 

I A G SMEDIPVEEAVLTGVAT 



K S E A 



i< v - r a l a d a ~. i i: ; ^ ••: v i t: •.- 

TCCTCTGTGGAAGACGGCACCACCGACA.TCACGTT "P.CCTGCCCTCG 2G" -ACGG^GC 

901 — . . -+ __- + + ^ 4 ~ 

H 960 

SSVfJDGTTDITFTCPRADGR 

CGTGCGATGGAGJ\TCTTGMGAAGCTTCAGGTTCAGGGCAACTGGACCAf^GTGCTTTAC 
361 + + „ + + + i02Q 

RAHEILKKLQVQGNWTNVLY 

GACGACCAGGTCGGCAAAGTCTCCCTCGTGGGTGCTGGCATGAAGTCTCACCCAGGTGTT 
102L + , ,+__., + + + ioeo 

DDQVGKVSLVGAGMKSHPGV 
ACCGCAGAGTTCATGGAAGCTCTGCGCGATGTCAACGTGAACATCGA^TTG-TTTCCATC 

xoei + + + + + lliJ0 

T ; A Z F M E A L R L V N V K I E h Z S I 

TCTGAGATCCGCATTTCCGTGCTGATCCGTGAAGATGATCTGGATGC^GCT^CACGTGCA 
.114! + „ + + __ + + ^ + i2oq 

SEIRISVLIREDDLDAA.-. K A 
TTGCATGAGCAGTTCCAGCTGGGCGGCGAAGACGAAGCCGTCGTTTA'tGCAGGCACCGGA 

1201 + + + i 126Q 

L H EQ FQLGGED Z .-. V . V Y h 1 7 g 

CGCTAA 
1261 1266 



3 

Nucleotide. sequence of ATCC21529 as3 (SEQ ID NO:,g) 

1 ATGACCACCA TCGCAGTTGT TGGTGCAACC GGCCAGGTCG GCCAGGTTAT 

SI GCGCACCTTT TTGGAAGAGC GCAATTTCCC AGCTGACACT GTTCGTTTCT 

101 TTGCTTCCCC GCGTTCCGCA GGCCGTAAGA TTGAATTCCG TGGCACGGAA 

151 ATCGAGGTAG AAGACATTAC TCP.GGCAACC GAGGAGTCCC TCAAGGGCAT 

201 CGACGTTGCG TTGTTCTCTG CTGGAGGCAC CGCTTCCAAG CAGTACGCTC 

251 CACTGTTTGC TGCTGCAGGC GCGACTGTTG TGGATAACTC TTCTGCTTGG 

301 CGCAAGGACG acgaggttcc actaatcgtc tctgaggtga acccttccga 

3 SI CAAGGATTCC CTGGTCAAGG GCATTATTGC GAATCCTAAC TGCACCACCA 

4 01 TGGCTGCAAT GCCAGTGCTG AAGCCACTGC ACGATGCCGC TGGTCTTGTA 

4 51 AAGCTTCACG TTTCCTCTTA CCAGGCTGTT TCCGGTTCTG GTCTTGCAGG 

501 TGTGGAAACC TTGGCAAAGC AGGTTGCTGC AGTTGGCGAC CACAACGTTG 

551 AGTTCGTCCA TGATGGACAG GCTGCTGACG CAGGCGATGT CGGACCTTAC 

601 GTTTCCCCAA TCGCTTACAA CGTG"TGCCA 7TCGCCGGAA ACCTCGTCGA" 

651 TGACGGCACC TTCGAAACCG ACG.=AGAGCA GAAGCTGCGC AACGAATCCC 

701 GCAAGATTCT CGGCCTCCCA GACC7CAAGG 7CTCAGGCAC CTGCGTCCGC 

751 GTGCCGGTTT TCACCGGCCA CACGCTGACC ATTCACGCCG AATTCGACAA 

801 GGCAATCACC GTCGAGCAGG CGCAGGAGAT CTTGGGTGCC GCTTCAGGCG 

8S1 TCGAGCTTGT CGACGTCCCA ACCCCACTTG CAGCTGCCGG CATTGACGAA 

901 TCCCTCGTTG GACGCATCCG TCAGSACTCC ACTGTCGACG ACAACCGCGG 

951 TCTGGTTCTC GTCGTATCTG GCGATAACCT 7CGCAAGGGC GCAGCACTGA 

1001 ACACCATTCA GATTGCTGAG CTGCTGGTTA AGTAA 



ATGACCACCATCGCAGT7GTTGG7GCAACCGCX]CAGGTCGGGCAGCTTATGCGCACCTTT 

1 — — * — '— - » 1 UO 

HTTIAVVCATCQVGQVMRTF 

TTGGW\GACCGCAJ\TTT«CAGCTG\C^CrTGrrcGTTTCTTrGCTTCCCCCCGTTCCCCA 
51 .. * + . 1 f l30 

LEeRNFPADTVRFFASPRSA 

GGCCGTMGArrGAATTCCGTGGCACCGAMTCCAGCTAGAACACAtTACTCAGGCAACC 



GAGGACTCCCT'CAAGGGCATCGACGTTGCGTTCTTCTCTGCTGOAGGCACCGCTrCCAAC 



eeslkctdvalfsaggt-sk 
Cagtacgctccactgtttgctgctgcaggi^ocgactgttgtggataactcttctgcttgg 



CCCW^GGACCACGAGGTtCCACTAAtCGTCTCTCAGGTORACCCTTCCGACAAGGATTCC 

RKODEVPLlVStVHPSOSDS 
CTGOTCAACGGCATTATTGCCAATCCTAtCTGCACCACCATCGCTGCAATGCCAGTCCTC 



AAGCCACTGCACGATGCCCCTGGTCTTCTf 

KPLHOAAGI, VKLKVSSTQAV 

TCCCGrrCTGGTCTTGCAGGTGTGCAA>.-t7TGGCA*AGCAGGT'rGC'tGOlGTTCGCGAC 
;1 »- — * --+-.. , 5^0 

SGSGLAGV ETwAKQYAAV GO 

CACAACGTTGACTTCGTCCATGATGGAC.'.GJGTGCTGACGCAGGCGATG'VCGGACCTTAC 

HKVEFVKOCOAAOAODVGey 
GTTTCCCCAATCGCTTACAACGTGCTGCCATTCGCCGGAAACCTCGTCCATGACGGCACC 

TTCGAAACCGACGAAGAGCAGAAGCTGCCC^jCGAATCCCCCAAGATrCTCGGCCTCCCA 
; 6i , 1 * + ; z 

FETDEEQKLRKESRKll/GLP 

GACCTCAAGCTCTCAGGCACCTCCGTCCGCGTGCCGGTTTTCACCGGCCACACCCTGACC 
721 + 1 * * --— * < It 

OUKVSGTCVRVPVfTCHt LT 

ATTCACGCCGAATTCGACAACGCAATCACCGTCGACCAGCCGCAGGAGArcrrGGGTGCC 
131 1 , ^ w 1 Q 

IKAGfOKAlTVEOAQEItCA 

GCTTqAGGCGTCGAGCTTGTCCACCTCCCMCCCCACTTCCACCTCCCCGCATTGACGAA 



TCCC'i'CGTTGGACGCATCCGTCAGGACTCCACTGTCGACGACAACCGCCGTCTC-GTTCTC 
q 0 l « + < 4 * 4 960 

5 L V C, R I R Q D S T V D D N 1-. G L V L 

GTCGTATCTGGCGATAACCTTCGCAAGGGCCCAGCACTGAACACCATTCAGATTGCTGAG 
361 + + + + + + 102Q 

VVSGDNLRKGAALNTIQIAE 

CTGCTGGTTAAGTAA 
1021 + 1035 

L L V K * 



Nucleotide. 



Sequence of 



cLapA (SSQ ID NO-.'jr> 



1 ATGAGCACAG GTTTAACAGC TAAGACCGGA GTAGAGCACT TCGGCACCGT 

51 TGGAGTAGCA ATGGTTACTC CATTCACGGA ATCCGGftGAC AtCGATATCG 

10 1 CTGCTGGCCG CGAAGTCGCG GCTTATTTGC TTGATAAGGG CTTGGATTCT 

151 TTGGTTCtCG CGGGCACCAC TGGTGAATCG CCAACGACAA CCGCCGCTGA 

201 AAAACTAGAA CTGCTCAAGG CCGTTCGTGA GGAAGTTGGG GATCGGGCGA 

251 AGCTCATCGC CGGTGTCGGA ACCAACAACA CGCGGACATC TGTGGAACTT 

301 GCGGAAGCTG CTGCTTCTGC TGGCGCAGAC GGCCTTTTAG TTGTAACTCC 

351 TTATTACTCC AAGCCGAGCC AAGAGGGATT GCXGGCGCAC TTCGGTGCAA 

401 TTGCTGCAGC AACAGAGGTT CCAATTTGTC TCTATGACAT TCCTGGTCGG 

451 TCAGGTATTC CAATTGAATC TGATACCATG AGACGCCTGA GTGAATTACC 

501 TACGATTTTG GCGGTCAAGG ACGCCAAGGG TGACCTCGTT GCAGCCACGT 

551 CATTGATCAA AGAAACGGGA CTTGCCtGGT ATTCAGGCGA TGACCCACTA 

601 AACCTTGTTT GGCTTGCTTT GGGCGGATCA GGTTTCATTT CCGTAATTGG 

651 ACATGCAGCC CCCACAGCAT TACGTGAGTT GTACACAAGC TTCGAGGAAG 

7 01 GCGACCTCGT CCGTGCGCGG GAAATCAACG CCAAACTATC ACCGCTGGTA 

"7 31. GCTGCCCAAG GTCGCTTGGG TGGAGTCAGC TTGGCAAAAG CTGCTcTGCG 

801 TCTGCAGGGC ATCAACGTAG GAGATCCTCG ACTTCCAATT ATGGCTCCAA 

9 51 ATGAGCAGGA ACTTGAGGCT CTCCGAGAAG ACATGAAAAA AGCTGGAGTT 

901 CTATAA 




, , AW,CCACftCfiTTT/->ACACCTftAf^-CGC,-V2TnCf^^A<;TTt .,.t:f..r;CTTGa/-.qTAc;CA 

^elc^t^ ^o-** \ — *- — * 



(■.(rrTATTTGC.TTGftTAACCJiCl 



CXAACGAC^CCGCCCCTCAAA/vftCTAGRACTGCrrC.^GCCC^TTCCTGAGGAAGTTCGC 



DRAKI-rAGVCTNKTRTSVEI. 
Caj<^5CTGCTCCTTCTG«CCCGCAGftCCGCCTTTTAGTtGTAACTCCTTATTftCTCC 



AEAA'ASAGADG LLVVT fit s 
AAGCCGAGCCAAGAGGGATTGCTGGCGCACTTCGGTGCAA^TGCTCCAGCAACAGAGGTT 



KPSQEGLtAH F G A I A A A T C V 
«AATTTCTCTCTATGACATTCCTCGTCGG7CAGGT.-.rTCCAATTGAl,?CTG/w^.CCATG 

i -- * ♦— -'<- * * <e< 

piclkoipgrsg: p : e s d ? k 
agacgcctgagtgaattacct^.ccattttggcotcia3gacc-zcaagcgrgficctcgtt 
rnl$elpti oavk&a. kgdlv 

GCAGCCACGTCATTQhTCAAAGWlAgGGGACTtGCCt3iTATTCAGGCCATGACCCACTA 
,1 + — „--<.-„.. x 6C 

AATSLIKSTQLAKYSCDDPL 

AACCTTCTTTGGCTTCCTTTGCSCGG.-.TCACCTTTCAtTTCCGTAATT-oGACATGCACCC 
01 « -+ - t , , G 

NL'VHt-AI-GGSGf 1 _ S V \ G H A A 

CCCACAGCATTACCtGAGTTGTACACAjiGC-TT-SACO-AGGC^.-.CC'C^TCCCTGCGCGO 



GAAATCAACGCCAAACTATCACCGCTCGTACCTGCCCAAGGTCGt 

E1NAKI, S PLVAAQGSLGCV £ 
TTGGCAAAAGCTGCf ctGCGTCTGCAGGCCATCAAGGrAGGAGATCCTCGACTTCCAArT 

IAKAALRLQG I » V C 0 P R L P I 
ATGGCTCCAAATGAGCAGGAACtTGAGGCTCTCCG?,GA.'\GAC-.TGA 1 AAAAGCTGGAGTT 



Nucleotide 



sequence of 



dapB ISfj ID NO: ^ 



L ATGGGAATCA AGGTTGGCGT TCTCCGAGCC AA.-.GGCCGTG TTGGTCAAAC 

51 TATTGTGGCA GCAGTCAATG AGTCCGACGA TCTGGAGCTT GTTGCAGAGA 

101 TCGGCGTCGA CGATGATTTG AGCCTTCTGG TAGACAACGG CGCTGAAGTT 

151 GTCGTTGACT tcaccactcc taacgctgtg atgggcaacc TGGAGTTCTG 

201 CATCAACAAC GGCATTTCTG CGGTTGTTGG AACCACGGGC TTCGATaATG 

251 CTCGTTTGGA GCAGGTTCGC GcCTGGCTTG AAGGAAAAGA CAATGTCGGT 

301 GTTCTGATCG CACCTAACTT TGCTATCTCT GCGGTGTTGA CCATGGTCTT 

351 TTCCAAGCAG GCTGCCCGCt TCTTCGAATC AGCTGAAGTT ATTGAqCTGC 

401 ACCACCCCAA CAAGCTGGAT GCACCTTCAG GCACCGCGAT CCACACTGCT 

451 CAGGGCA.TTG GTGCGGCACG CAAAGAAGCA GGCATGGACG CACAGCCAGA 

501 TGCGACCGAG CAGGCACTTG AGGGTTCCCG TGGCGCAAGC GTAGATGGAA 

S%1 TCCCaGTTCA cGCAGTCCGC ATGTCCGGCA TGGTTGCTCA CGAGCAAGTT 

601 ATCTTTGGCA CCCAGGGTCA GACCTTGACC ATCAAGCAGG ACTCCTATGA 

651 TCGCAACTCA TTTGCACCAG GTGTCTTGGT GGGTGTGCGC AACATTGCAC 

"7 01 AGCACCCAGG CCTAGTCGTA GGACTTGAGC ATTACCTAGG CCTGTAA 




ATGGGAATCAAGGTTGGCGTTCTCGGAGCCAAAGGCCGTGTTGGTCAAACTATTGTGGCA 

I + + + ,..._+ + — . — ,, — + 60 

M G I KVGVLGAKGP. VGQTIVA 

GCAGTCAATGAGTCCGACGATCTGGA.GCTTGTTGCAGAGATCGGCGTCG.'-.CGATGATTTG 
cu 4 + h t .+ 12o 

AVNESDDLELVAEtGVDDD L 



121 



AGCCTTCTGGTAGACAACGGCGCTGAAGTTGTCGTTGACTTCACCACTCCTAACGCTGTG 



180 



SLLVDNGAEVVVDFTTPNAV 



ATGGGCAACCTGGAqTTCTGCATCAACAACGGCATTTCTGCGGTTGTTGGAACCACGGGC 
181 + __ + + + + + 24Q 

MGNLEFCIN HG ISAVVGTT G 

TTCGATaATGCTCGTTTGGAGCAGGTTCGCGcCTGGCTTGAAGGAAAAGACAATGTCGGT 
2 <1 + + + + + . + 300 

FDNAftliEQVRAWLEGKD-NV G 

GTTCTGATCGCACCTAACTTTGCTATCTCTGCGGTGTTGACCATGGTCTTTTCCAAGCAG 
301 — . + _ + < + 1 + 350 

VLIAPNFAISAVLTMVFSKQ 

\ 

GCTGCCCGCTTCTTCGAATCAGCTGAAGTTATTGAGCTGCACCACCCCAACAAGCTGGAT 
361 + + - + + + + 420 

AARTFESAEVlSuHHPNKLD 

GCACCTTCAGGCACCGCGATCCACACTGCTCAGGGCATTGCTGCGGCACGCAAAGAAGCA 
421 <• + 1 + + + 460 

APSGTAIHT AQGIAAARKEA 

GGCATGGACGCACAGCCAGATGCGACCGAGCAGGCACTTGAGGGTTCCCGTGGCGCAAGC 
481. + . + +- ■ + + . + 540 

GMDAQPDATEQALEGSR'G'A'S 

GTAGATGGAATCCCaGTTCAcGCAGTCCGCATGTCCGGCATGGTTGCTCACGAGCAAGTT 
541 ■ + + ■ — + + -+ — - — ■ + 600 

VDG I PVHAVRMSGHVAHEQV 

ATCTTTGGCACCCAGGGTCAGACCTTGACCATCAAGCAGGACTCCTATGATCGCAACTCA 
601 .- + +- — + + + * 660 

iFGTQGQTtTIKQDSYDRNS 

TTTGCACCAGGTGTCTTGGTGGGTGTGCGCAACATTGCACAGCACCCAGGCCTAGTCGtA 
661 + + +--- . + . . + + 72Q 

FAFGVLVGVRNIAQHPGLVV 
GGACTTGAGCAl'TACCTAGGCCTGTAA 

->zi — . + + lA1 

GLEHYLGt*- 



uclcofcj.de. secuance oC cidW (SfiQ ID NO-.X) 



1 ATGCATTTCG GTAAGCTCGA CCAGGACAGT GCCACCACAA TTTTGGAGGA 

51. TTACAAGAAC ATGACCAACA TCCCCGTAGC TATCGTsGGC TACGGAAACC 

101 TGGGACGCAG CGTCGAAAAG CTTATTGCCA AGCAGCCCGA CATGGACCTT 

151 GTAGGAATCT TCTCGCGCCG GGCCACCCTC GACACAAAGA CGCCAGTCTT 

201 TGATGTCGCC GACGTGGACA AGCACGCCGA CGACGTGGAC GTGCTGTTCC 

251 TGTGCATGGG CTCCGCCACC GACATCCCTG AGCAGGCACC AAAGTTCGCG 

301 CAGTTCGCCt GCACCGTAGA CACCTACGAC AACCACCGCG ACATCCCACG 

3S1 CCACCGCCAG GTCATGAACG AAGCCGCCAC CGCAGCCGGC AACGTTGCAC 

4 01 TGGTCTCTAC CGGCTGGGAT CCAGGAATGT TCTCCATCAA CCGCGTCTAC 

4 SI GCAGCGGCAG TCTTAGCCGA GCACCAGCAG CACACCTTCT GGGGCCCAGG 

501 TTTGTCACAG GGCCACTCCG ATGCTTTGCG ACGCATCCCT GGCGTTCAAA 

551 AGGCcGTCCA GTACACCCTC CCATCCGAAG AaGCCCTGGA AAAGGCCCGC 

601 CGTGGCGAAG CCGGCGACCT cACCGGAAAG CAAACCCACA AGCGCCAATG 

651 CTTCGTGGTT GCCGACGCGG CCGAcCACGA GCGCATCGAA AACGACATCC 

701 GCACCATGCC TGATTACTTC GTTGGCTACG AAGTCGAAGT CAACTTCATC 

151 GACGAAGCAA CCtTgGACgC CGAGCACACC GGCATGCCAC ACGGcGGaCA 

SOI CGTGATcACC ACCGGCGACA CCGGTGGCTT CAACCACACC GTGGAATACA 

351 TCCTgAAGCT GGACCGAAAC CCAGATTTCA CCGCTTCtTC ACAGATCGCT 

901 TTCGGcCGCG CAGCTCACCG CATGAAGCAG CAGGGCCAAA GCGGtGCTTT 

951 CACCGTCCTC GAAGTTGCTC CATACtTGCT CTCCCCgGAG AACTTGGAtG 

1001 ATCTGATCGC ACGCGACGTC TAA 




A" "/>T?VtJi;C.T.V-/aCTCGACCJ--V^V.^ 

■ 1© M'J/^CR.^CATCIG^TACCTATCCTi^CCTftCCf.rW^CTGCGACCCfiGCSTCCftAMG 
H T U t r V A t V G Y C H I. C B S V e K 
crrATTGCCAAGCfiGCCCCACATCWfeCCTTCTACCAATCTTCTCOCGCCCCCCCACCCTC 



LIAKQPDH6LVCI r S R R ri T t. 
GACACAAAGACCCCAGTCTTTGATGTCGCCGACCTGGACAAGCACCCCCACGACCTGGftG 



GTGCTCTTCCTGTGl^TGCGCTCCCCCACCCACATecCTlSRGCACG^CCAAAGTTCGCG 
L v * * ——+-,.. , , 3 

vLrLCHCSATOIFCQAP KFA 

CACTTCGCCTGCACCGTAGACACCTACGACflACCRCCGCGACATCCCACGCCRCCCCCAG 



hTGAACGAAGCCGCCACCCCAGCCGGCAACGTTCCACTGGTCTCTACCCGCTGGGAT 



VHHEAATAfiCNVALVST = WD 

ccag5aatgttctccatcaaccgcgtctacgcagcggcactcttaggcca;-caccagcag 



PGHTSIHRVYAAAVLAE 
CACACGTTCTGGGGCCCAGClTTGTCACAGGGCCACTCCGATCCTTTGCGJ 



H-fViGPGtSQGHSO. ALU* TP 

GGCCTTC?.\MGGCcGTCCAGTACACCCTCCCATCCCW.GA3GCCCTGC;.-.=AGGCCCGC 

C 'QKAVQYTLPSEEALE rlAR 
CGTCSCGAAGCCCCCGACCTCACCGGAAAGCAAACCCACAAGCGCCAAT 



CCCGAeGCGGCCGAcCACGAGCGCATCCAAAACGACATCCGCACCATCC^TjRTTACTTC 



ABRADHERICNDI RTMf 2 Y f- 
^ ^ ^ GTTGGCTACGAAGTCGAAGTCMCTTCATCGACGAAGCAAcCTTgGACsCZCSAGCACACC 

GGCATGCCACACGGcGGaCACGTGATcACCACCCGCGAC ACCGGTGGCTT - AACCACACC 
7 6l , 4 + „ + _ «•_ _^ b 

GMP1IGCHVITTCDTCCF;: 11 t 
GTGGftATACATCCtgAACCrGGACCGAAACCCAGATTTCACCCCTTCtTrACACATCCCT 



O R N f 0 T 



TTCGG=-GCftCAGCTCACCCCA: 



'TCAAGCAGCAGGGCCAAAGCGGtGCTTT^CCGTCCTC 



1- G R A A » « H K 0 C '.• S G A F V L 

GftAGTTGCTCCATACtTGCTCTCCCCqG/-.CAACTTGG.i-,tGATCTGATCGCA-<:':CGf-;GTC 
9G1 + <■ f + +^ + 

EVAPYLLSPEHLDDLIAp. DV 
TAA 

1021 1023 




Sequence o£ full length LysA from NHEL B-11474 (SEQ ID NO; 
11) Underlined region; the priming site for lysA primer 



CGCCAAGAAC^CGGCGTTGTCACCGTCGCTGGTOTCCCTCTGCCTGACCTCGCTGAAGAA 

TACGGAACOTCACTGTTCGTAGTCGAe<W3GACGAOTrcCG'K^ 

GCTACCS3CA3>rC^T6<^CA©SCAATGTG(^ 

ACCATTGC»C<3TTG<3OTTGATGAAGAG<K3<K:^ 

CT<KGCATOGCCCTGGCCGCTGGTTTCCCGGCCAGCra^ 

AAAGGCGTAGAGTTCCTGCGXXKGTTGGTTCA&AACGGTGTCGGGCATGtoMTGCTG 

TCCX3C<3CAaGAATTGGAACTGCTGGAT^CGTTGCC<^G6TlSftAGG^ 

GTGTTGATCCGCGTGAAKCA^TATCfiAAGCCCACACCCACGAG'PrCATCGCCACTAGC 




ATCC^CAGCGAACTAGGTGTCGCCCTTCCTGAGOTC5GACCMG<3lX5GCG{^TACGGCATC 

ggctagactgcagatgaggaaccactca acgtcgcagargtcgcctccgaccta ctcacc 
gcagtcggaaaaatggcagcggaactaggcatcgacgcaccaaccotgcttgttgagccg 
gggcgcgctatckcaggcccctccaccgtgaccatctacgaagtcgggaccaccaaaaac 
c?tccacctagacgacqacaaaacccx3ccggtacgtagccgk(^c(mag0catgtccgac 
aacatca2cccag<^ctctacggctccga^^ 

gaaggagacccagtaa(k^cccgcatcgtgggctcggactgcgaatccg<^ga5atcct(3 

ATCAACGATGAAATCTACCCATCTGAGATCACCAGCGGCGACTTCCTCtJCACTCGCAGCC 
ACCGGCGCATACTGCTACGCCATGAGCTCCCGCTACAACGCCTTCACACGGCCCGCCGTC 
GTGTCCGTC(^(^G<K^G<^CC<3CCTCATGOTKX3CCGCGAA&CCOT 
CTCTCACTAGAGGCATAA 




DIAMTNOPIMELATE DECARBOXYLASE S ^ t Ci OO'.tX 

HEDQKFGFS^SGSATEAAKAAHHA^^ 
GRAlAGFaTVTXYIWGTTKNVHmrc^ 
VSVRAGSSELMLRRETLDDILSLSA 



P^. 13 



:lssotide sequences of AS01D lysA (SEQ ID NO: 

1 ATGGCTACAG TTGAAAATTT CAATG.'-ACT"? CCCGCACACG TATGGCCACS 
51 CAATGCCGTG CGCCAAGAAG ACGGCTfTGT CACCGTCGCT GGTGTGCCTC 
101 TGCCTGACCT CGCTGAAGAA TACGGAACCC CACTGTTCGT AGTCGACGAG 
151 GACGATTTCC GTTCCCGCTG TCGCG.-.CATG GCTACCGCAT TCGGTGGACC 
201 AGGCAATGTG CACTACGCAT CTAAAGCGTT CCTGACCAAG ACCATTGCAC 
251 GTTGGGTTGA TGAAGAGGGG CTGGCACTGG ACATTGCATC CATCAACGAA 
301 CTGGGCATTG CCCTGGCCGC TGGTTTCCCC GCCAGCCGTA TCACCGCGCA 
3S1 CGGCAACAAC AAAGGCGTAG AGTTCCTGCG CGCGTTGGTT CAAAACGGTG 
401 TGGGACACGT GGTGCTGGAC TCCGCACAGG AACTAGAACT GTTGGATTAC 
4 51 GTTGCCGCTG GTGAAGGCAA GATTCAGGAC GTGTTGATCC GCGTAAAGCC 
501 AGGCATCGAA GCACACACCC ACGAGTTCAT CGCCACTAGC CACGAAGACC 
551 AGAAGTTCGG ATTCTCCCTG GCATCCGGTT CCGCATTCGA AGCAGCAAAA 
601 GCCGCCAACA ACGCAGAAAA CCTGAACCTG GTTGGCCTGC ACTGCCACGT 
651 TGGTTCCCAG GTGTTCGACG CCGAAG3CTT CAA.GCTGGCA GCAGAACGCG 
701 TGTTGGGCCT GTACTCACAG ATCCACAGCG AACTGGGCGT TGCCCTTCCT 
751 GAACTGGATC TCGGTGGCGG ATACG3CATT GCCTATACCG CAGCTGAAGA 
801 ACCACTCAAC GTCGCAGAAG TTGCCTCCGA CCTGCTCACC GCAGTCGGAA 
851 AAATGGCAGC GGAACTAGGC ATCGACGCAC CAACCGTGCT TGTTGAGCCC 
901 GGCCGCGCTA TCGCAGGCCC CTCCACCGTG ACCATCTACG AAGTCGGCAC 
951 CACCAAAGAC GTCCACGTAG ACGACGACAA AACCCGCCGT TACATCGCCC- 
1001 TGGACGGAGG CATGTCCGAC AACATCCGCC CAGCACTCTA CGGCTCCGAA 
1051 TACGACGCCC GCGTAGTATC CCGCTTCGCC GAAGGAGACC CAGTAAGCAC 
1101 CCGCATCGTG GGCTCCCACT GCGAATCCGG CGATATCCTG ATCAACGATG 
1151 AAATCTACCC ATCTGACATC ACCAGCSGCG ACTTCCTTGC ACTCGCAGCC 
12 01 ACCGGCGCAT ACTGCTACGC CATG«CTCC CGCTACAACG CCTTCACACC- 
12 51 GCCCGCCGTC GTGTCCGTCC GCGCTGGCAG CTCCCGCCTC ATGCTGCGCTC 
1301 GCGAAACGCT CGACGACATC CTCTC^CTAG AGGCATAA 



ATGGCT ACAGTTGAAAATTTCAATGAACTTCCCGCACACGTATGGCCA.CGC.-.-.TGCCGTC 
, , < , + + , GQ 

MAT V E N FNELPAHVW P R A V 

CGCCAAGAAGACGGCGTTGTCACCGtCGCTGGTGTGCCTCTGCCTGACCTCGCTGAAGAA 



+ + + . l2Q 

RQEDGVVTVAGVPLPDLAEE 
TACGGAACCCCACTGTTCGTAGTCGACGAGGACGATTTCCGTTCCGGCTGTCGCGACATG 



1^1 + + + 4- + + 180 

YGT PLFVVD -EODFRSRCRDM 

GCTACCGCATTCGGTGGACCAGGCAATGTGCACTACGCATCTAAAGCGTTCCTGACCAAG 
181 + + + +— + ^+ 24 0 

ATAFGGPGNVHYASKAFLTK 

accattgcacgttgggttgatgaagaggggctggcactggacattgcatccatcaacgaa 

241 + — + +■ + + + 300 

TIARWVDEEGLALDIASINE 

CTGGGCATTGCCCTGGCCGCTGGTTTCCCCGCCAGCCGjfATCACCGCGCACGGCAACAAC 
301 + +. + + + + 360 

L G I ALAAG FPASRI TAHGN N 

AAAGGCGTAGAGTTCCTGCGCGCGTTGGTTCAAAACGGTGTGGGACACGTGGTGCTGGAC 
361 * + + + + ,j 20 

KGV EFLRALVQNGVGHVY. LD 

TCCGCACAGGAACTAGAACTGTTGGATTACGTTGCCGCTGGTGAAGGCAAG.-.TTCA.GGAC 



saqelelldyvaagegki'q D 

GTGTTGATCCGCGTAAAGCCAGGCATCGAAGCACACACCCACGAGTTCATCGCCACTAGC 
481 + + + + + + 5AQ 

V LI RVKPGIEAHTH E F I T S 

CACGAAGACCAGAAGTTCGGATTCTCCCTGGCATCCGGTTCCGCATTCGA&GCPGCAAAA 
5^1 + + + + + + 60Q 

h edqkfgfslasgsafe.-. a k 
gccgccaacaacgcagaaaacctgaacctggttggcctgcactgccacgttggttcccag 

601 + + * 1 + + 6SQ 



AANNAENL 



NLVGLHCHVGSQ 



GTGTTCGACGCCGAAGGCTTCAAGCTGGCAGCftGAACGCGTGTTGGGCCTGTACTCACAG 



ATCCACAGCGAACTGGGCGTTGCCCTTCCTGAACTGGATCTCGGTGGCGGATACGGCATT 
721 *. + + + + + 180 

I HSE1.GVALPELDLGGGVG I* 

GCCTATACCGCAGCTGAAGAfi.CCACTCAACGTCGCAGAAGTTGCCTCCGACCTGCTCACC 
781 ~ — 4- ^ + +' — + + 840 

AYTAAEEPL UVAEVASD r%i L X 

. ■ 

GCRGTCGGAAAAATGGCAGCGGAACTAGGCATCGACGCACCAACCGTGCTTGTTGRGCCC 
841 + -t- + + +---■ + 900 

AVGKMAAELGIDAP. TVLVEP 

GGCCGCGCTATCGCAGGCCCCTCCACCGTGACCATCTACGAAGTCGGCACCACCAAAGAC 
901 + ^ + + — + + 960 



GTCCACGTAGACGACGACAAAACCCGCCGTTACATCGCCGTGGACGGAGGCATGTCCGAC 
961 ■ +■ + + - + + — + 1020 

V HVD DDKTRRY IAV0GGMS D 

AACATCCGCCCAGCACTCTACGGCTCCGAATACGACGCCCGCGTAGTATCCCGCTTCGCC 
1021 + + + 7 + + + 1080 

N IRPALYGSEYDARVVSRE-ft 

GAAGGAGACCCAGTAAGCACCCGCATCGTGGGCTCCCACTGCGAATCCGGCGATATCCTG 
1081 + - + + *■ — - ■ + mo 

EGDPVSTRIVGSHCESGDI L 

ATCAACGATGAAATCTACCCATCTGACATCACCAGCGGCGACTTCCTTGCACTCGCAGCC 
1141 -• + + + + + " + 1200 

I NDS IYPSD1TSGDFLALAA 

ACCGGCGCATACTGCTACGCCATGAGCTCCCGCTACAACGCCTTCACACGGCCCGCCGTC 
1201 + + + + + + 1260 

T GAYCYAMSSRYNAFTRPAV 

GTGTCCGTCCGCGCtGGCAGCTCCCGCCTCATGCTGCGCCGCGAAACGCTCGA.CGACATC 
1261 + + + + + + 1320 



RAGSSRLMLRRBT 



sequence o£ arf2 in dapBA opocon 



1 GTGGCCGAAC AAGTTAAATT GAGCGTGGAG TTGATAGCGT GCAGTTCTTT 

51 TACTCCACCC GCTGATGTTG AGTGGTCAAC TGATCTtGAG GGCGCGGAAG 

X01 CACTCGTCGA GTTTGCGGGT CGTGCCTGCT ACGAAACTTT TGATAAGCCG 

15 L AACCCTCGAA CTGCJTCCAA TGCTGCGTAT CTGCGCCACA TCATGGAAGT 

201 GGGGCACACT GCTTTGCTTG AGCATGCCAA TGCCACGATG TATATCCGAG 

251 GCATTTCTCG GTCCGCGACC CATGAATTGG TCCGACACCG CCATTTTTCC 

301 TTCTCTCAAC TGTCTCAGCG TTTCGTGCAC AGCGGAGAAT CGGAAGTAGT 

351 GGTGCCCACT CTCATCGATG AAGATCCGCA GTTGCGTGAA CTTTTCATGC 

401 ACGCCATGGA TGAGTCTCGG TTCGCTTTCA ATGAGCTGCT TAATGCGCTG 

4 51 GAAGAAAAAC TTGGCGATGA ACCGAATGCA CTTTTAAGGA AAAAGCAGGC 

501 TCGTCAAGCA GCTCGCGCTG TGCTGCCCAA CGCTACAGAG TCCAGAATCG 

551 TGGTGTCTGG AAACTTCCGC ACCTGGAGGC ATTTCATTGG CATGCGAGCC 

601 AGTGAACATG CAGACGTCGA AATCCGCGAA GTAGCGGTAG GATGTTTAAG 

G51 AAAGCTGCAG GTAGCAGCGC CAACTGTTTT CGGTGATTTT GAGATTGAAA 

7 01 CTTTGGCAGA CGGATCGCM ATGGCAACAA GCCCGTATGT CATGGACTTT 

751 TAA 




(jo*, to, aw^ o^J^ s^u^<^_ 

GTGGCCGAACAAGTTAAATTGAGCGTGGAGTTGATAGCGTGCAGTTCTTTTACTCCACCC 
MAEQVKLSVEL I A CSS FT =• p 
GCTGATGTTGAGTGGTCAACTGATGTTGAGGGCGCGvAAGCACTCGTCGAGTT72CGGGT 



CGTGCCTGCTACGAAACTTTTGATAAGCCGAACCCTCGAACTGCTTCCAATGCTGCGTAT 
121 + + + + + + 

RACY ETFDKPHPRTASNAAY 
CTGCGCCACATCATGGAAGTGGGGCACACTGCTTTGCTTGAGCATGCCAATGCCACGATG 



241 



LRHI MEVGHT-ALLEHANATM 
TATATCCGAGGCATTTCTCGGTCCGCGACCCATGAATTGGTCCGACACCGCCATTTTTCC 



Y 1 RG ISRSATHELVRHRHF 



+ _ H + + + + 3g0 

FSQLSQRFVHSGESEVVVpT 



361 



CTCATCGATGAAGATCCGCAGTTGCGTGAACTTTTCATGCACGCCATGGATGAGTCTCGG 

LI DEDPQLRELFMHAMDESR 
TTGGCTTTCAATGAGCTGCTTAATGCGCTGGAAGAAAAACTTGGCGATGAACCGAATGCA 



420 



FA FN ELLNALEEKLGDEPNA 

CTTTTAAGGAAAAAGCAGGCTCGTCAAGCAGCTCGCGCTGTGCTGCCCAACGCTACAGAG 
481 + + + + + + 54Q 

L L RKKQARQAARAVLPn'at"e 

TCCAGAATCGTGGTGTCTGGAAACTTCCGCACCTGGAGGCATTTCA'TTGGCATGCGAGCC 
541 + < +— + + + 600 

S R IVVSGNFRTWRH-FIGMP. A 
AGTGAACATGCAGACGTCGAAATCCGCGAAGTAGCGGTAGGATGTTTAAGAAAGCTGCAG 

SEHADVEIREVAVGCLRKLQ 
^ GTAGCAGCGCCAACTGTTTTCGGTGATTTTGAGATTGAAACTTTGGCAGACGGATCGCAA 

VAAPTVFGDFEIETLADGSQ 

ATGGCAACAAGCCCGTATGTCATGGACrTTTAA 
7 2l «. + + _ 753 



601 



660 



MATS P Y V 



I-IALWQKYGG SSL.ESAERIR 



ATCC 13032 
N13 

ATCC 21529 
Consensus 



ATCC 13032 
M13 

ATCC 21529 
consensus 



ATCC 13032 
N13 

ATCC 21529 
Consensus 



ATCC 13032 
N13 

ATCC 21529 
Consensus 



ATCC 13032 
N13 

ATCC 21529 
Consensus 



ATCC 13032 
N13 

ATCC 21529 
Consensu 



'. KACNDWW.C S.-v.GDTTDEL 
100 



LELAAAVNPV PPAREMDMLL. TAGSRISNAk VAMAXESLGA EAQSFTGSQA 
101 150 



•GVLTTERHGN ARIVDVTPGR VREALDEGKI CIVAGFQGVN KETRDVTTLG 
151 200 



RGGSDTTAVA ItAAALNADVC EIVSOVDGW TADPRIVPNA QrXEKLSFEE 
201 2SQ 



ML.ELAAVGSK ILVLRSVEVA RAFSV PLRV?, SSYSNDPGTL IAGSMSDIPV 
251 ' 300 



EEAVLTGVAT DKSEAKVTVt, GISDKPGEAA KVFRALADAE XKIDMVLQNV 
301 . 350 



ATCC 13032 
N13 

ATCC 21529 
Consensus 



ATCC 13032 
N13 

ATCC 21529 
Consensus 



SSVEDCTTOI TFTCPRADGR RAtfilLKKL-3 
351 ~< r " 



NKTNVLY D^VGKVSLV 



GAGMKSHPGV TAEFMEALRD VNVjS'tELIST SSIRISVL.IR EDDLDAAARA 
401 421 



LHEQFQLGGE DEAWYACTG R 




AACCGGTGTGGA<K:CGACXlkTTO 

GCCAGTTCATGGATTGGCTGCCGAAGAAGCTATAG 

GGTGC^GTGCirrKXKOTQGG^GGGAO^T^ 

AATAGTCCATG<KiCACGTGGATGCT(KX^CAT^ 

CAG<XKMTGTTTTCrrrGCGCTGCTCK^ 

CAAGAACTCXTTACnACClTITlTATrGTC 

GTTACCCCAAAAAGCATATACAGAGACCAATGATTITTCATrAAAAAGGCAG 
CrraTTATTCnXJTGCGAC^GGTGTACXrrCGGCTAGAAT^ 





j£_ —J 



4 




4L 



ear 




fig. ^ 



PCI?.- DMA 




Mz- — _) 



\ 1 *M i frPH-i- f 



....... ,.pr<&ni^(y;. . f- 



1 GTGGCCGAAC AAGTTAAATT GAGCGTGGAG TTGATAGCGT GCAGTTCTTT 

51 TACTCCACCC GCTGATGTTG AGTGGTCAAC TGATGTTGAG GGCGCGGAAG 

101 CACTCGTCGA GtTTGCGGGT CGTGCCTGCT ACGAAACTTT TGATAAGCCG 

151 AACCCtCGAA CTGCTTCCAA TGCTGCGTAT CTGCGCCACA TCAtGGAAGT 

201 GGGGCACACT GCTTTGCTTG AGCATGCCAA TGCCACGATG TATATCCGAG 

251 GCATTTCTCG GTCCGCGACC CATGAATTGG TCCGACACCG CCATTTXTCC 

301 TTCTCTCAAC TGTCTCAGCG TTTCGTGCAC AGCGGAGAAT CGGAAGTAGT 

351 GGTGCCCRCT CTCAT' 



121 



GTGGCCGAACAAGTTAAATTGAGCGTGGAGTTGftTftGCGTGCAGTTCTTTTACTCCACCC .V. 
+ + „<.. + + ^ 

MAEQVKLSVELI/-. CSSFT=-p 

GCTGATGTTGAGTGGTCAACTGATGTTGAGGGCGCGGAAGCACTCGTCGAGTTTGCGGGT 
L . < 1 +, ^. ^ l2Q 

ADVEWSTOVEGAEALVEFAG 

CGTGCCTGCTACGAAACTTTTGATAAGCCGftACCCTCGAACTGCTTCCAATGCTGCGTAT 

+ _ + + + — + + 180 

RACY ETFDKPHPRTASNAAY 

CTGCGCCACATCATGGAAGTGGGGCA.CACTGCTTTGCTTGAGCATGCCAATGCCA.CGRTG 
+ „ . + l+ + — . _ + 

LRHIMEVGKTALLEHANA.TM 



TATATCCGAGGCATTTCTCGGTCCGCGACCCATGAATTGGTCX;GACACCGCCA'I"rTTTCC 

VIRGISRSATHEL v'r H R H F S 
TTCTCTCAACTGTCTCAGCGTTTCGTGCACAGCGGAGRATCGGAAGTAGTGGTGCCCACT 



300 



301 



+- + + 4- -~ + + 360 

QLSQRPVHSGESEVVVPT 
CTCAT ... 



361 

L (I) 



fvq. 



^a** 2.0 



TACraGAACCCCAC^TTOSTAGTCGACS^^ 

ACCATTGCACGirrGGGl'TGATGAAQAGGGGCTaaCACTGGACATTQCXJTCC^^ 

CTCGGCATTOCCOTGGCCGCTGGl^K^CGGCCAC^CG 

AAAGGCGmGAGlTCCtGCGCGCGTTGGTT^^ 

TCCGCGC^GGAAO'TGGAAGrGCTGGATTACGTTGC 

OTQrimTCCGCGrGAAOCau^TATCGAAGC^CACACCCA^ 

CACGAAGACCAGARGT0XX5SAT1H^CCTGGCATCCGG^ 

GCAGCC^tlAATGCAGAGAAC^rTGAACCTGGTT 

GTG^GAQGCCGAAGG(?miAAGCT<3<2CAGCA^ 

ATCCACAGCGAACTAGGrGTH2G<^CTrCCTGAG^ 

GGglACAGTQCAGATGAGGA&CC^GTCA ACGTOGCAGAJU^ , ■ 




DIAMZNOPXHEIiATE DECAHBOXYLftjSE 



Fig, ^ 



